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MODIFIED BACTERIAL SURFACE LAYER PROTEINS 

Introduction 

The present invention relates to modified bacterial surface layer (S-proteins) 
that can be used in vaccines (when expressed on the surface of a cell), molecular 
sieves and sensors, and bacteria expressing such S-proteins. The S -layer proteins 
have inserted into them, at an internal position, one or more heterologous (or 
functional) polypeptides. Such a polypeptide may be an antigen or an antibody, or a 
portion thereof. Other modifications include the truncation of the carboxy (C) or 
amino (N) terminus to form fragments. 

Backgi'oimd 

Bacterial surface layer proteins are known to exist in many different types of 
bacteria, and often assemble into crystalline layers at the cell surface. However, the 
exact function of these proteins is not always known, although they are thought to 
play a role in bacterial adhesion to other cells, or to act as a scaffold for enzymes. 

Lactobacillus is one of the genera belonging to the group of lactic acid 
bacteria (LAB) which are used in the manufacture of fermented foods and beverages, 
and are generally considered to be safe. Bacteria of the genus Lactobacillus can 
protect humans against infection, and this property (and their ability to colonise 
mucosal surfaces) has resulted in the use of Lactobacilli as vaccine delivery vehicles 
for oral immunisation. The adhesion to the mucosa is thought to be a pre-requisite 
for the survival and establishment in the gastrointestinal tract (GIT). Some surface 
located molecules have been identified as adhesions that specifically interact with 
receptor moieties in the internal tissue. 

The surface layer of proteins self-assembles into multimeric units to form an 
array covering the entire cell as the outermost envelope. They are single protein 
molecules and form crystalline monolayers. However, the adhesive properties of S- 
layers, especially in probiotic Lactobacilli, remains poorly characterised. 
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S-layers have been found in several Lactobacillus species, and although the 
function of these S-proteins is not known, they are thought to play a role in cell 
adhesion. S-layer proteins of Lactobacillus are amongst the smallest S-proteins 
known, and are different from most other bacterial S-proteins because they are highly 
basic (pi of at least 9) instead of acidic proteins. 

Several S-layer proteins have been sequenced. These include the rsaA gene 
of C. crescentus (US 5500353, Smit et al), and Lactobacillus acidophilus (Boot et al 9 
1993 and Smit et al, 2001). Although the sequence of the S-protein is given, the 
secondary and tertiary structure of the protein cannot be predicted simply from 
knowing the amino acid sequence. Indeed, from that document it is impossible to 
predict, for example, which amino acid sequence might be exposed to the surface, 
and which amino acids might be present in the S-layer. 

In addition, the cbsA gene encoding the collagen binding S-layer protein of 
Lactobacillus crispatus has been expressed in i. casei. (Martinez et al, Journal of 
Bacteriology 182 (23): 6857-6861, December 2000). 

As the structure and role of surface layer proteins is not well understood, 
there is therefore a need to determine a structure/function relationship, in order to 
allow the production of modified surface layer proteins which have improved 
properties over the wild-type. Surface layer proteins represent a large proportion of 
protein in the cell (as much as 10 to 12%) and may therefore be used to express large 
quantities of a heterologous protein. In addition there is a need for more effective 
(oral) vaccines, in particular those based on bacteria (or Lactobacillus) where a 
greater amount or higher density of antigen can be presented on the cell, or may be 
better presented at the cell surface. 
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A first aspect of the present invention therefore relates to a (e.g. modified) 
bacterial surface layer (S-layer) protein, the modification comprising the insertion, 
internally, of a heterologous polypeptide. Such a polypeptide may be a functional 
one, and can be any polypeptide of interest. 
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The modification is the insertion of the polypeptide at an internal location. 
That is to say, the insertion is at an internal sequence, which is of course different 
from the addition (or fusion) of a polypeptide at either the carboxy (C) or amino (N) 
terminus. The insertion may take place at least 5, 10, 20, 30 or 40 amino acids from 
either end of the protein. 

The polypeptide may be inserted (in the sense that it slots in between two 
adjacent amino acid residues) rather than replacing any residues (where residues are 
deleted or substituted). 

Location of inserted heterologous polypeptide 

The polypeptide may be inserted at a location whereby the polypeptide is 
exposed, or away from the cell surface when the surface layer protein is expressed (at 
the surface) of a bacterial cell. The heterologous polypeptide may therefore be 
exposed in the sense that it can be targeted or bound by a compound or other 
polypeptide (for example an antibody). This may be possible if the polypeptide is in 
a portion of the S-layer protein which is involved in or responsible for crystallisation 
and/or assembly, for example is part of the crystallisation or assembly domain. 

However, the alternative is possible, where the heterologous polypeptide is 
not exposed, for example it is located (at least partly in) in the cell surface, or in the 
S-layer. This may be possible by inserting the heterologous peptide at a location at 
or near a cell waU anchor or is in an attachment or binding domain. This part of the 
molecule may be in the S-layer. In such case the heterologous peptide may be 
present or buried in the surface layer. It may thus be protected from the outside 
environment, for example shielded from proteolytic processing. It may not be bound 
or recognised by external antibodies or shielded from external degradation of 
proteolytic attack. This may be advantageous if the protein is to be located on a 
bacterial cell wall, and then administered to a human, as it may mitigate or avoid 
degradation, or proteolytic processing, by enzymes in the gastrointestinal tract (GIT). 

For the S-layer protein of Lactobacillus acidophilus, two main domains have 
been identified. The first N-terminal region, which constitutes about two thirds of 
the molecule, is involved in crystallisation and assembly (and is called SAN)- It 
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constitutes amino acids 1 to 290 and effectively forms the S-layer above the cell 
wall. The second main domain, or last third of the molecule, amino acids 290 to 412, 
represents the cell wall anchor (and is called SAC). This portion of the molecule is 
buried in the S-layer. 

The N-terminal assembly region can in itself be split into three main portions; 
1 to 1 14 (involved in forming dimers and timers); 150 to 290 (involved in forming 
dimers); these flank an intervening domain (about 115 or 120 to 150 or 160) that 
appears to be loop region (or a surface (immunodominant) loop) that is exposed and 
can be targeted by antibodies. 

Therefore, if the heterologous polypeptide is to be exposed, then the insertion 
can be at a location of from any of amino acids 1 to 290, and preferably from 115 or 
120 to 150 or 160 (as this appears to be a surface exposed loop region). The N- 
terminal crystallisation section or assembly region may be primarily hydrophobic, 
such as inZ. acidophilus. This is unexpected, and counter-intuitive, given that this is 
a region that is (surface) exposed, and present in the environment outside the 
bacteria, which is necessary in order for the S-layer proteins to assemble and form a 
layer. 

The cell wall anchor may be equally unusual, in that it is primarily 
hydrophilic, despite being present in the S-layer. (Cell wall binding regions are 
usually hydrophilic as they bind negatively charged groups in the cell wall). The cell 
wall anchor region can alternate between hydrophilic and hydrophobic regions in 
some S-proteins. The protein may thus be inserted in a hydrophobic region (e.g. for 
surface exposure) even though such a region would not usually be exposed on the 
surface. 

The above two paragraphs relate mainly to Lactobacillus proteins, such as 
from L. acidophilus. For many other (S-layer) proteins the reverse is true, and in this 
case the N-terminal domain can be hydrophilic while the C-terminal may be 
hydrophobic. 

If the heterologous polypeptide is to be exposed, then preferably it is located 
at a position from 100 to 150 or 160, such as from 1 10 to 140 or 150, preferably from 
120 to 130 or 140, optimally at around position 125. Preferably the polypeptide is 
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located in a region of the S-layer protein that is exposed or is in a surface (or 
immunodominant) loop. 

Note that the numbering of residues in this specification is (unless the context 
requires otherwise) based on the numbering of the original, (mature) wild-type, 
protein. Another region for exposure of the heterologous protein is from 130 to ISO, 
such as from 140 to 170, preferably from 150 to 160, optimally at around position 
156. An additional location for exposing the polypeptide, although to a lesser 
degree, is at one of the positions from 20 to 80, such as from 30 to 70, preferably 
from 40 to 50, optimally at around position 45. 

If the heterologous polypeptide is not to be exposed, or is to be at least 
partially present inside the S-layer, then the insertion can take place at a different 
location in the unmodified S-layer protein. This may be from position 1 to 30, such 
as from 3 to 20, preferably from 5 to 10, and optimally at about position 7. 

Suitably the heterologous polypeptide is inserted into the protein, rather than 
replacing any part of the protein sequence. In that way, most of (eg. at least 70, 80, 
S5, 90 or even 95%) or the entire or full sequence of the (wild-type or original) 
protein can be retained. Hence, there may be no deletion or replacement of amino 
acid residues (or if there are, there are no more than 5, 10, 15 or 20 deletions or 
replacements). This may help the modified protein to retain as many of the 
properties or functions of the original molecule as possible. The modified protein 
may be at least as long, if not longer, than the unmodified (wild-type) protein. 

Modified Proteins 

The following comments relate to both the modified (of the invention) and 
unmodified (wild type) proteins, unless the context requires otherwise. The protein 
preferably has a minimum size of 30, 40 or 50 kDa. It may have a maximum size of 
70, 80 5 100, 150 or even 200 or 300 kDa. Preferred proteins are from 40 to 50, 60, 
70 or 80 kDa in size. The protein may be able to crystallise, or with other proteins 
form a sheet or crystalline monolayer (e.g. in vitro). They may be able to self- 
assemble, for example into multimeric units. If possible, the protein may be able to 
form (with other proteins) an oblique lattice (for example of pi or p2 symmetry). 
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The protein may have at least two domains or. functional regions. One may 
be involved in or responsible for crystallisation or assembly. This region may be 
exposed, and away from die cell membrane. The protein may therefore have a (e.g. 
surface exposed) loop region. The protein may have a cell wall anchor or cell wall 
binding region. This may be wholly or partly present in the S-layer. 

Preferably the modified protein will not comprise (or retain) a (eg. S-layer) 
secretion signal (this may have been cleaved during processing or expression). It 
may have a (C-tenninal or cell wall) anchor region, which may be the original region 
from the S-layer protein or it may be (in addition) such a region from another S-layer 
protein, such as of a different species (preferably from the species in which the 
modified protein is to be expressed). 

The (unmodified or wild-type) protein may be from a lactic acid bacteria, 
such as from Lactobacillus. Preferably, the protein is from L. acidophilus, L. 
crispatus, L. helveticus, L. amylovorus, L. gallinarum and/or L. brevis. Preferably 
the protein is predominantly basic. Suitably, the protein is not from a Gram negative 
or aquatic bacteria, for example of the group Caulobacter, such as from C. 
crescentus, or from the family Campylobacteraceae (eg. those belonging to 
Campylobacter). 

The protein may be expressed in a bacterial host. If it is expressed on the 
surface, such as in the S-layer, then bacterial host cell may be used in a vaccine. The 
heterologous polypeptide may be exposed, e.g. to the environment. If however the 
polypeptide is to survive inside the body (or be shielded from the environment or 
from degradation or proteolytic attack), for example in part of the GIT, then the 
heterologous peptide may be partially or wholly present or buried in the S-layer. 

Preferably the protein has a pi of at least 6, 7 or 8. Optimally however it has 
a pi of at least 9, such as at least 9.4. 

hisertion Siies 

The location of the inserted polypeptide may effect crystallisation. Modified 
peptides which crystallise, or do not crystallise, can both be prepared using the 
invention. The position of insertion of the polypeptide may be at from 1 to 290, such 
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as at any of the following locations (if for example the protein is desired to be able to 
crystallise): 

(1) at from 1 to 20, such as 3 to 15, preferably from 5 to 10, optimally at 
about position 7; 

(2) at from 35 to 55, such as 40 to 50, optimally at about position 45; 

(3) at from 100 to 130, such as from 110 to 120, optimally at about 
position 114; and/or 

(4) at from 1 10 to 140, such as from 120 to 130, optimally at about 
position 125. 

The heterologous polypeptide can be inserted at position above residue 290, 
such as from 291 to 400, 425 or 450 (eg. about 412). If the protein is not to be 
capable of crystallising then the insertion may be: 

(1) at from 20 to 40, such as 25 to 35, optimally at about position 30; 

(2) at from 50 to 80, such as at 60 to 70, optimally at about position 66; 

(3) at from 70 to 100, such as from 80 to 94, preferably from 85 to 90, 
optimally at about position 8S; and/or 

(4) at from 1 15 to 150 or 140 to 180, such as from 150 to 170 or 160 to 
180, preferably from 150 to 160 or 170 to 180, optimally at about position 
156 or 177. 

An insertion at above position 290 may not affect crystallisation as this may 
not be in a surface or exposed position (for example it may be in a cell wall anchor 
region). The insertion may thus be: 

(5) at from 320 to 41 0, such as from 330 to 370, preferably from 340 to 
360, optimally at about position 349. 

There may be one or more heterologous polypeptides, inserted at one or more 
different sites. 

Sensors and Molecular Sie\>es 

The protein or bacterial host cell can be employed in sensors, for example 
included in a coating. They may also find use in molecular sieves. For example, a 
plurality of bacteria, expressing a modified protein, can be arranged to form a hole or 
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to define a pore. By changing the location and nature of the heterologous 
polypeptide, one can change not only the protein size and characteristics but also the 
confirmation of the protein. This is turn may change the pore or hole characteristics, 
for example in certain parameters such as affinity or charge. 

A multiplicity of proteins may be able to crystallise or form 2-dimensional 
arrays on a (e.g. solid) surface (e.g. comprising silicon, polymer(s) or metal(s)) or an 
interface (e.g. a liposome or planar lipid film). An S-layer may be used as a template 
in the fomiation of an inorganic (e.g. nanocrystal) lattice (e.g. comprising CdS 3 Au, 
Ni 5 Pt or Pd), such as hi electronics or non-linear optics. 

The functional domains on each S-protein may be repeated with the 
periodicity of the S-layer lattice. This may allow the formation of regular arrays of 
bound molecules or particles. 

The (modified) S-proteins of the invention may interact with each other, and 
if appropriate, with a supporting cell envelope layer (such as plasma membrane, 
outer membrane or peptidoglycan). The protein may have a high proportion of non- 
polar amino acids, so that hydrophobic interactions play a role in assembly. 
Preferably, a multiplicity of S-proteins have the capability to reassemble, for 
example in suspension, at a liquid air interface, at a solid surface, at a floating lipid 
monolayer or on (the outside of) a liposome. The S-proteins may therefore form a 
monolayer, at a surface or an interface, and this allows them to be used in 
nanobiotechnological applications. 

A further aspect of the invention therefore relates to a sheet or (optionally 
crystalline) monolayer or two dimensional array comprising a polarity of bacterial 
surface layer proteins. At least one of such proteins is preferably a modified protein 
according to the first aspect. 

The invention therefore stems, in a further aspect, to a solid surface, liquid-air 
interface, lipofilm, liposome or solution comprising a sheet, monolayer or array of 
the previous aspect. 

Preferably, a multiplicity of the S-proteins may be disintegrated by a 
chao tropic agent, by lowering or raising the pH value, or by addition of a chelating 
agent or cation substitution. Preferably, after removal of the disrupting agent, the 
proteins will reassemble into a flat sheet, an open-ended cylinder or a closed vesicle. 
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Any of these arrays may be a double layer, or bi-layer, for example with two 
constituent monolayers facing each other (either with their inner or outer side). Bi- 
layers may, if necessary, contain divalent cations interacting with polar groups. 

The S-proteins may be able to reassemble at an air-water interface, on a solid 
support, or in a lipid film. In this way, a (randomly aligned) monocrystalline S-layer 
may be formed. If formation takes place at a liquid-solid interface, then the solid 
may comprise silicon, such as a silicon wafer. The silicon wafer may have a native 
oxide layer, it may be silanised, or it may be coated with a photoresist and/or carbon 
film. Other solids can be employed, such as silicone nitride, galium arsenide, noble 
metals (gold or platinum), glass, cellulose, graphite or mica. 

Preferably, the S-layer can be patterned using deep ultra-violet (DUV) 
radiation. The S-layer, such as on a silicon wafer, may be brought into (direct) 
contact with a photomask, and exposed to DUV radiation. The S-layer may be 
removed from the silicon wafer or substrate at the exposed areas. Thus, a pattern on 
the photomask may be transferred into the S-layer. 

The S-layer protein (or sheet or array) may be used to immobilise a 
biologically active (macro) molecule. Molecules such as enzymes, antibodies, 
antigens and/or ligands may be covalently immobilised to an S-protein (mono) layer. 

A layer of S-proteins may therefore act as a surface linking layer. The 
S-protein layer may be inbetween, or sandwiched between functional molecules and 
a solid surface or support. The functional molecule may be an enzyme, antibody (or 
fragment thereof), or other binding molecule, a receptor, antigen or ligand, any of 
which may act as a bioreceptor (such as for a target, in a sample). The S-layer 
proteins may be self-assembling, and they may be capable of recrystallising at the 
surface. They may thus provide a linking layer between the solid surface and the 
functional molecules. This type of arrangement may be used in a (bio) sensor, and 
the invention therefore relates to sensors, molecular sieves and ion traps comprising 
an S-protein layer of the invention. 
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Fragments of Surface Layer Proteins 

The invention additionally relates to modified proteins that are fragments of 
bacterial S-layer proteins. The fragments may be N-terminal fragments. 
Alternatively or in addition, the fragments may be capable of forming trimers, for 
example with two other such fragments. The fragment may be able to form 
multimers (e.g. trimers) with different fragments, or indeed the wild type molecule as 
well. These fragments can be thought of as C-terminal deletions because, as they 
retain a portion of the N-temiinus, they can be prepared by, in effect, deleting or 
removing all or a portion of the C-terminus. 

The fragments may be capable of forming dimers, preferably with another 
such fragment (although the formation of dimers with other fragments or the full size 
wild-type molecule may also be possible). Such fragments may include the 
(immunodominant or exposed) loop region. The fragment may then be from 20 to 
200 amino acids long. Another type of fragment may be one that excludes the entire 
(immunodominant or exposed) loop region. This may allow for at least part of the 
loop region to be present. Such fragments may be from 20, 30 or 40 to 100, 130 or 
155 amino acids long. 

The loop region may be a region that, in the full length of wild-type 
molecule, is exposed to the environment when expressed on the surface of a 
bacterium. It may be exposed in the sense that it may be bound or targeted by an 
(external) antibody specific for that region. The N-teiminal fragments or those that 
are capable of trimerisation may include part or all of the loop region. The fragments 
may thus be from 10 to 30, 40 or 50 amino acids long. For certain bacteria, such as 
L. acidophilus, this region may be from 20 to 40 amino acids in length, such as from 
32 to 38 amino acids. 

Preferably the fragment is at least 40, 60, SO, 120, 130 or at least 140 amino 
acids in length. The fragments may be up to 170, 150, or 120 amino acids in length. 

If the fragments are fragments of an S-layer protein from Lactobacillus, such 
as L. acidophilus, the fragments are suitably fragments of the assembly or 
crystallisation region, know as SAN in the art, namely residues 1 to 290. 
Particularly preferred fragments (using the numbering of the residues of 
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L. acidophilus S-layer protein) are 1-159, 1-149, 1-114, 115-290, 139-290 and 
150-290. Preferably the fragments will not contain any part of SAC or the cell wall 
anchor region. 

Novel (Bacterial) Cells 

A second aspect of the invention relates to a (e.g. modified) bacteria, or 
bacterial cell, which has been modified to express a heterologous surface layer 
(S-layer) protein. Preferably, the bacterium is other than an L. casei cell or a 
Bacillus (e.g. B. sphaeiicus or B. brews) cell. Preferably the bacterial cell is 
however Lactobacillus cell or from the family Lactobacillaceae. The S-layer protein 
may have its own (or original) cell wall anchor. In other words, the cell wall anchor 
is homogenous to the S-layer protein, or the protein retains its natural cell wall 
anchor domain, or region. Preferably, the S-layer protein has a cell wall anchor 
which is from the same bacterial species as the S-layer protein. Such a bacteria, as a 
wild-type, or in unmodified form, may not possess such a surface layer. 

Preferably the (host) cell expresses, or contains a gene encoding, from 1 to 6 
different surface layer proteins (either naturally, as a wild type, or in a modified 
form). 

The S-layer protein is preferably one of the modified bacterial surface layer 
proteins of the first aspect although it need not be. For example, the bacterial cell 
may be one which does not (normally) have an S-layer, (for example, Lactobacillus 
reuteri, Lactobacillus fennentum, Lactobacillus salivarius, Lactobacillus animalis, 
Lactobacillus gasseri, Lactobacillus johnsonii, Lactobacillus murinus, Lactobacillus 
sakei, Lactobacillus curvatus, Lactobacillus delbi-ueclcii subsp bulgaricus, 
Lactobacillus delbriieclcii subsp lactis, Lactobacillus plantarum, Lactobacillus 
peniosus, Lactobacillus caseL Lactobacillus zeae, Lactobacillus paracasein 
Lactobacillus rhamnosus or a bacteria from Lactococcus) but in modified form does 
have an S-layer (or at least carries an (e.g. modified) S-protein at its surface). The 
S-layer may thus comprise a naturally occurring surface layer protein, albeit one that 
is heterologous to the bacteria. For example, the bacteria may be L. caseL which 
does not possess an S-layer, but which has been modified to express surface layer 
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proteins from another bacteria, such as from a lactic acid bacteria or Lactobacillus, 
for example L. acidophilus. 

A third aspect relates to an L. acidophilus cell which expresses (e.g. on the 
surface of the cell) a heterologous surface layer protein. In this case, L. acidophilus 
does naturally possess an S-layer, and so the S-protein may be a modified one, or one 
from a (bacterial) species other than Z,. acidophilus, or both. 

The surface layer protein may thus be a naturally occurring or wild-type 
protein, or it may be a modified surface layer protein for example one of the present 
invention, as described in the first aspect, or a mixture of both. Preferably the 
S-layer protein is antigenic and/or induces an immune response. The protein is 
suitably not a part of a (e.g. collagen) binding surface layer protein, for example of 
Z. crispatus. 

Preferably the number of S-layer proteins is sufficient to form an S-layer, and 
preferably an S-layer that covers the entire cell. Hence the number of S-proteins at 
the cell surface may be at least 1, 2 or 5xl0 5 S-layer proteins (or monomers), 
preferably at least 10 6 , optimally at least 5xl0 6 monomers. 

Preferably the S-layer protein, even if modified, retains the cell wall anchor 
of the unmodified or wild-type protein. However, it may have a heterologous anchor 
or binding region or domain, and so may be a fusion protein. This may attach the 
protein to the S-layer, cell wall, lipid bilayer, cytoplasmic membrane, peptidoglycan 
or to a cell wall component. Whatever form of linkage, the protein is preferably 
located at or in the surface of the host. 

Four different types of cell wall binding domains are envisaged. The first 
may bind (e.g. covalently) to the peptidoglycan part of the cell wall. Suitable anchor 
may be from a proteinase, such as PrtP (Novarce, 1994). Secondly, the binding may 
be (e.g. covalently) to the lipid bilayer, for example through N-acyl diglyceride 
modification of a cysteine residue (for example located immediately C-tenninal to 
the signal sequence cleavage site (Pugsley, 1993). Thirdly, there may be a non- 
covalent linkage to the cytoplasmic membrane (for example, Lysteria monocytogenes 
actA , Cocks et al., 1992). A fourth possibility is an anionic interaction, such as non- 
covalent one, which may be to a cell wall component. This may thus be a different 
method from linkage to a peptidoglycan or membrane. 
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For the first type of linkage, the fusion protein may comprise a sorting signal 
(such as from a proteinase (PrtP) or L. casei). Preferably the bacteria heterologously 
expresses an S-layer protein encoding gene to produce the expressed protein. 

A fourth aspect of the invention relates to an L. casei bacterial cell expressing 
a (bacterial) surface layer protein that is not from L. crispatus and/or is not a collagen 
binding protein. The S-layer protein may be one that does not bind collagen, for 
example it is not a collagen binding S-layer protein. Preferably the S-layer protein is 
antigenic, or capable of inducing an immune response. Preferably, the L. casei cell 
expresses an S-layer protein that is naturally from, or originates from, 
L. acidophilus. However, the S-layer protein may be derived from a wild-type or 
naturally occurring L. acidophilus protein, and for example it may be an S-layer 
protein modified in accordance with the first aspect. The S-protein may be expressed 
on the surface of the cell. This may be achieved by providing the S-protein with a 
cell wall anchor that allows it to bind to the cell-surface of L. casei, for example an 
anchor region from an L casei surface layer protein. This may be from a proteinase 
(such as PrtP). 

A fifth aspect of the invention relates to a (modified) bacteria expressing 
only, or homogenously, a heterologous or modified bacterial surface layer (S-layer) 
protein. The protein may be heterologous in the sense that the bacteria does not 
normally or in unmodified foim does not express that protein. The heterologous or 
modified protein may therefore replace the natural or wild-type protein, preferably 
completely, or in its entirety. Thus, the bacteria may express only the heterologous 
or modified S-layer protein, but not any of the wild-type S-layer protein, at least in 
the S-layer. (Previous aspects included, where appropriate, cells that expressed both 
the wild-type S-protein as well as a modified S-protein). The bacteria may be 
another than from Caulobacter, 

The fifth aspect therefore includes bacteria having a genome which contains 
or incorporates a polynucleotide encoding a heterologous (to the bacteria) or non- 
wild-type or modified S-layer protein. That is to say, such a polynucleotide may be 
integrated into the genome. Preferably, the gene or polynucleotide encoding the 
wild-type or natural S-layer protein has been silenced, switched off, replaced or 
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otherwise caused to be non-expressed. The wild-type may thus be exchanged for a 
mutant (allele), in effect in vivo. 

Preferably the surface layer forms an array, and may cover the entire cell. It 
may thus form an outermost envelope. The surface layer may be a crystalline 
monolayer, or comprise (e.g. self) assembled S-layer proteins. These may be in the 
form of monomelic units. The S-layer may therefore comprise an anray of single 
protein monomers (the S -proteins). 

hi all aspects, suitably the S-layer protein, or bacterial cell, is from a lactic 
acid bacteria, or from the Bacilhis/Clostridium group of bacterial (e.g. low GC 3 Gram 
positive bacteria). Suitably the protein or cell will be from the family 
Lactobacillacede, such as genus Lactobacillus. 

Bacterial Cells of the Invention 

The cell may be a Gram positive bacteria, for example from the AT rich 
(rather than GC rich) group (also called the low GC class). Suitably the bacteria is 
from the Actinomycetes class (which includes Carnobacterium, Lactobacillus, 
Lactococcus, Leuconstoc, Pediococcus and Streptococcus. Preferably it is non- 
aquatic. It may be a Lactobacillus cell, or a lactic acid bacteria (LAB). It is 
preferably L. plantarum (such as L. plantarum SO or 256), L. casei, L. brevis, L. 
acidophilus, L. crispatus, L. helveticus, L. amylovoras, L. gallinarum or L. lactis. 
Preferably the host cell is one that can express the desired modified protein, which 
may be antigenic, for example using recombinant techniques. The host cell may 
express only the modified S-layer protein (and no wild-type S-layer protein, as in the 
third aspect above) or it may express both (the modified and unmodified) S-layer 
proteins. 

A Lactobacillus plantarum strain can readily be determined using known 
parameters (e.g. Bergeys Manual of Determinative Bacteriology and Vescovo et al, 
Ann. Microbiol. Enzymol. 43 261-2S4 (1993)). The skilled person can therefore 
readily determine whether a lactic acid bacterium is a Lactobacillus plantarum. 
Numerous Lactobacillus plantarum strains have been deposited at various institutes 
and are readily available. 
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The bacterial (host) cell should preferably have GRAS (generally regarded as 
safe) status, and more preferably be food grade. This may thus exclude soil bacteria, 
or Bacillus (e.g. food spoilage organisms, such as Bacillus terreus or sphaericus). 
Such organisms are not suitable for administration to humans (e.g. as a vaccine). 

Also, the bacterial host used should allow - upon transformation with an 
appropriate construct encoding an antigen- expression of the modified protein on the 
surface. 

Preferably the level of expression of the protein, e.g. intracellularly, secreted 
and/or exposed on the surface (such as determined by SDS-polyacrylamide gel 
electrophoresis or FACS), should be at least 0.1, 0.5, 1, 3 or 5, 7, 10 or 15% (e.g. 8, 9 
or 10 to 12, 13 or 14%) of the total cell protein. 

Also, the bacterial host is preferably capable of settling or persisting in and/or 
colonizing at least part of the gastrointestinal tract, such as the mouth, the throat, the 
larynx, the gut, the small intestine, the large intestine, the ileum and/or the colon, 
vagina, or a combination thereof. Preferably the bacterial host is such that it mainly 
settles in the intestine, more preferably in the small intestine or caecum. 

The bacteria preferably exhibits a persistence in the individual to be 
immunized (upon oral administration and as determined by the presence of the strain 
in the faeces) of at least 5 days, preferably at least 9 days, and suitably more than 1 5 
or even 20 days. Longer persistence may not be required if an administration 
regimen comprising the use of one or more booster immunisations as described 
below. 

The L, plantarum strain may be LMG 921 1 (NCHMB 8S26 as described by 
Mercenier above), DSM 4229 L. casei 393, and/or L. plantarum 80. Preferably 
however the bacterial host is one of the following L. plantarum strains: 256, LMG 
1284, LMG 6907, LMG 8155, LMG 9205, LMG 9206, LMG 9208, LMG 9209, 
LMG 9210, LMG 9212, LMG 1 1405, LMG 1 1460, LMG 8095, LMG 8027, LMG 
12167, LMG 13556, LMG 17552, LMG 1S021, LMG 18023, LMG 18024, LMG 
1S027, LMG 18095; 3S6, 299 , 105 or 275 (see Molin et al., 1993. J. Appl. 
Bacterid. 74:314), 299v (see WO 96/29083); So5,36 E , 95, 120 or 44 (see Johannson 
et al 1995, Int. J. Syst. Bacteriol, Vol 45(4):670-675), 79, 107, 98, 53, 97, 101 or 125 
(see Johansson et al 1995 Int. J. Food. Micro. 25:159), CH, ATCC 8041, ATCC 
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10012, ATCC 10776, WCFS, DF66 IHa, DF66spez. -IVa, and/or theZ. plantai-um 
strains available from tlie Japanese Collection of Micro-organisms under the 
accession numbers: 8341, S342, 8343, S344, S345, S346, 8347 and/or 8348. 

Preferably the (e.g. Lactobacillus) bacterium employed is foreign to the 
individual (human or animal) to which it is to be administered, for example 
vaccinated. The bacteria of the invention are modified or recombinant, and it is so 
the wild-type or naturally occurring bacteria, that is to say the non-recombinant or 
unmodified bacteria, that is suitably foreign. Thus by foreign it is intended to refer to 
a (e.g. Lactobacillus) strain not normally present in the individual. In other words, it 
is preferred to use an (e.g. Lactobacillus) strain that is non-human, for example it is 
not found or present in humans. Preferably the strain will be one not found in 
humans or animals, for example a strain that does not exist in the gut (G.L tract) 
and/or vagina. Such strains will provoke a greater or larger immune response. 
Although some prior workers have used human-derived L. plantarum strains (such as 
NCIMB SS26, a human saliva isolate) it is preferred to use a strain not present in or 
originating from humans. 

The present invention therefore preferably uses a bacterial strain that is not 
found in the mucosa (non-mucosal) or G.L tract of individual, for example it is not 
endogenous (to humans or a species of animal to be vaccinated). Indeed, the most 
preferred strains (e.g. L. plantarum 256) are found in silage. Such strains are clearly 
foreign to humans, and so provide an increased immune response. Nevertheless, 
preferably the strain may be capable of adhering to the intestinal mucosa. 

The strain may also be of non-foodstuff origin, for example not found in 
(human or animal) foods. This may thus exclude some L. casei strains, e.g. 393. 
Strain 393 is found in cheese, and so preferably strains found in dairy or fermentation 
products are excluded. This means that the individual has not (or may not) have had 
contact (e.g. ingested) the strain, and so using strains not found in foods is more 
likely to provoke an immune response. This may explain the long persistence times 
found with certain Lactobacilli. It may be that those bacteria are cleared more 
slowly as they are not recognised by the immune system. Suitably the strain is thus 
of animal origin, for example from an animal feedstuff (e.g. silage). Strains that are 
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generally more suitable are commensal (in the gut), rather than dietary (e.g. dairy 
origin). 

Preferably the bacterial (e.g. Lactobacillus) strain is viable (or alive) and 
intact. Suitably it will be able to persist (in the mucosa) of the individual for at least 
7 days. This can easily be tested using procedures known in the art (for example, 
testing for the existence of the organism in faeces). 

Thus the cell may be a non-human and/or non-human food Lactobacillus 
bacterium, such as L. plantarum, which has been modified to express the modified 
protein intracellularly or on the cell surface. This bacterium is preferably able to 
elicit an immune response in an individual, to whom the bacterium is administered. 
Thus the naturally occurring or unmodified Z,. plantarum is preferably foreign to that- 
individual, for example it is not endogenous to humans or the animal to which it is to 
be administered. Preferably the chosen L. plantarum strain will not be present in the 
G.L tract or mucosa of humans or that species of animal. 

Other suitable strains may be selected by the skilled person on the basis of 
one or more of the following properties or factors: 

- stability of the construct encoding the antigen in the bacterial host selected; level 
of expression of antigen in the bacterial host selected; regulation of expression of 
antigen in the bacterial host selected; site of expression of antigen in the bacterial 
host selected; and/or stability of antigen produced; 

- the biochemical properties of the strain used, for example its sugar fermentation 
profile (API), cell wall composition, structure of LTA, structure of 
peptidoglycan, 16S RNA sequence, acid resistance, bile acid resistance, 
agglutination properties, adjuvanticity, immune modulating properties, in vitro 
adherence properties, mannose-specific adherence, presence of proteinaceous 
adherence factors, presence of mapA-like adherence factors and/or presence of 
large proteinaceous adherence factors with repeated amino acid sequences; 
and/or 

- the interaction of the bacterial host with cells of the individual to which the host 
to be administered (i.e. as part of a vaccine according to the invention) including 
but not limited to its persistence , viability, in vivo expression of antigen and/or 
tissue-specific persistence. 
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In at least some, preferably most, and more preferably essentially all of these 
properties, the strain used should be essentially (at least) equivalent to the strains 
mentioned above, and more preferably equivalent to L. plantarwn 256, e.g. as 
determined on the basis of tests/assays for these properties known per se in the art. 
After a suitable host has been selected, it may be transformed with a genetic 
construct as described herein, after which its suitability as an oral vaccine may be 
tested. It is envisaged that on the basis of the description herein, and optionally after 
carrying out - for the purposes of confirmation- the tests described herein, the skilled 
person will be able to identify other L. plantarwn strains suitable for use as or in 
vaccines of the invention. 

Preferably the L. plantai~um is in the group (or cluster) 1, (which includes 
strains 101, 97, 53, 256, ATCC 14917, 36 E , 95, 98, 299, 299v, 107, 105, 79, 275, 
386, So5 and ATCC 8014), suitably subgroup (or subcluster) lb (which includes 
256, ATCC 14917, 36 E , 95 and 98), rather than subgroup (or subcluster) lc (which 
covers human originating strains). Particularly preferred is the L. plantarum strain 
256. 

To classify L. plantarum strains a method can be used in which the genomes 
of the Lactobacillus strains are characterized by performing a restriction 
endonuclease analysis (RE A) of chromosomal DNA followed by agarose gel 
electrophoretic separation of the fragments (Stahl et al 9 Int. J. Syst. Bacterid. 40: 
189-93 1990; Johansson et al, Int. J. Syst. Bacterid. 45, 670-675 1995). The power 
of the REA technique is that it provides a unique fingerprint of the chromosomal 
DNA of each strain. When REA was applied to classify L. plantaum strains and the 
results were analysed by cluster analysis it was shown that this species is comprised 
of two clusters. Cluster 1 comprises strains originating from both human intestine 
and fermented food and feed, whereas cluster 2 comprises only two strains from 
Nigerian ogi. Cluster 1 contains 3 subgroups, la, lb and lc. All human L. 
plantarum strains were found in group lb. L. plantarum strains of non-human origin 
were found in all 3 sub-groups. 

A further distinction between L. plantarum strains from the three subgroups 
of cluster I was described by Adlerberth et al (Appl. Environm. Microbiol. 62, 
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2244-2251 1996) who observed that strains from subgroup lc adhered considerably 
stronger to human adenocarcinoma cells in vitro compared to strains from subgroups 
la and lb. Strains from subgroups la, lb and lc can be further discriminated on the 
basis of their hemagglutination patterns. Strains from subgroup lc strongly 
agglutinated human, guinea pig, chicken, cat dog, mouse, rabbit, rat, horse and pig 
erythrocyte, while those from la and lb display strongly reduced or no agglutination 
characteristics. 

A combination of one or more of the strains mentioned above may also be 

used. 

Preferably the host cell has only a limited persistence in the gastrointestinal 
tract, such as from 3 to 10 days, preferably from 4 to 8 days, similarly from 5 to 7 
days. 



Brief description of the Figures 

Figure 1. (A) Linker insertion mutagenesis and E. coli expression vector 
constructions. Sites chosen for linker insertion are indicated by solid vertical arrows. 
For details on linker insertion positions the reader is referred to Table 1. (B) 
Lactobacillus vector constructions. Promoter sequences are indicated by an arrow 
pointing to the right and a stemloop indicates the presence of a transcription 
terminator sequence. Black barr indicates the slpA sequence encoding mature S A - 
protein and grey bars the signal sequence. Restriction sites are abbreviated as 
follows: B, BamHI; K, Kpnl; P, Pstl; Bs, BstEEL; H, HindUL, Nc, Ncol; N, NotI; X, 
XJrol S, Sail; Bg, BgXL; E, EcoSl. 



Figure 2. SDS-PAGE analysis of S A -protein mutants purified from E. coli. Total (T), 
pellet (P) and supernatant (S) fractions obtained by dialysis and centrifugation. The 
numbers above each panel represent mutant numbers. 

Figure 3. SDS-PAGE and western analysis of surface protein extracts obtained from 
wild type L. acidophilus (lane 1) and transformants producing mutant S A -proteins 
Sa9c> S A i!c 5 S A i2c and S A i3c (lanes 2 to 5). The top panel shows a coomassie stained 
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SDS-PAA gel, the middle panel a western analysis with anti-S A -protein and the 
lower panel a western analysis with anti-c-myc. The arrow indicates S A -protein (43 
kDa) and the Mr's of the reference proteins are given in the left margin. 

Figure 4. Detection of c-myc at the cell surface of L. acidophilus transformants using 
flow cytometry. Cells were coated with mouse monoclonal anti-c-myc antibodies 
followed by fluorescent labelling with goat-anti-mouse FITC-conjugated antibodies. 
Fluorograms of L, acidophilus producing S A9c , Sauc, S A i2c and S A i3c are indicated 
with 9c, 1 1c 12c and 13c, respectively. The inset shows the negative control (wild 
typeL. acidophilus). 

Figure 5. Western analysis of SAN and four N- or C-tenninally truncated SAN 
peptides. Peptides were separated by SDS-PAGE and detected with anti-his 
antibody. Oligomers of SAN (<8>) and dimers of SAN2, SAN4, SAN6 and SAN7 (p) 
peptides are indicated with arrows (lane 1 to 5, respectively). Mr's or reference 
proteins are indicated in the right margin. 

Figure 6. Hypothetical structural organisation of S A -protein monomer and the L. 
acidophilus S-layer. (A) Contact points of one repeating unit with surrounding units 
are indicated with arrows and the putative boundary between two morphological 
subdomains (black and black shading, respectively) with a dashed line. (B) Overview 
of repeating units forming the S-layer structure. Repeating units as well as 
subdomains are indicated as in (A). (C) Schematic representation of one S A -protein 
monomer showing the SAN and SAC domains and the two putative SAN 
subdomains. (D) Schematic representation of a cross-section of the L. acidophilus 
cell wall and S-layer. 

Brief description of the Sequences 

SEQ ID NO. 1 is a DNA sequence encoding the (prior art) L. acidophilus 
(mature) S-layer protein (sip A gene) which was modified to create the surface layer 
proteins of the invention; 
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SEQ ID NO. 2 is the amino acid sequence of SEQ ID NO. 1; 

SEQ ID NOS. 3 to 17 are amino acid sequences showing the position of (and 
including) the heterologous polypeptide (sequence) used to create the modified 
proteins of the invention; and 

SEQ ID NOS. IS to 33 are primers used to construct the proteins of the 
invention. 

Mode of vaccine delivery 

Preferably this is a mucosal or oral or nasal vaccine. This may contain one or 
more of the bacterial cells of any of the aspects of the invention previously described. 
The vaccine may comprise lactose, such as to assist in the expression of the modified 
protein. This can mean any vaccine suited, adapted, intended and/or formulated for 
mucosal delivery. 

Mucosal delivery means any route of administration to the body of a human 
or animal for which it is not required to penetrate or puncture the skin (e.g. as with 
intravenous, intramuscular, subcutaneous or intraperitoneal administration). Usually, 
this means that the vaccine is administered to the body via one of the body cavities, 
such that it comes into contact with the mucosa. Hence mucosal administration in 
particular refers to nasal, oral and/or vaginal administration. 

Oral delivery (of a vaccine) means any route of delivery to the body of a 
human or animal into, by which the vaccine can be presented to, the gastrointestinal 
(G.I.) tract or any part thereof. Usually, this will involve administration into or via 
the mouth into the G.I. tract This also includes administration directly into the G.L 
tract or into any part thereof, and including into the stomach, for instance using a 
tube or catheter. 

An oral vaccine is one that is suited, adapted, intended and/or formulated for 
oral delivery as defined above. 
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Immune Response 

Suitably the vaccine will elicit an (immune) response. A response (e.g. an 
antibody response or immune response) is deemed significant if it leads to a 
detectable change or response in a human or animal, and in particular to a detectable 
immunological change or response, such as the production of antibodies, cytokines, 
lymphokines, etc. Tests for determining whether a response is significant are known 
in the art and include, but are not limited to, titration of antibody levels in biological 
samples using ELIS A techniques, ELISPOT techniques and in vitro lymphocyte 
stimulation assays. Such techniques are usually carried out on a biological sample, 
such as a biological fluid or cell sample, obtained from the human or animal. 

A significant response may be, but is not necessarily, also a protective 
response as defined below. A response (e.g. an immunological response against a 
pathogen or an antigen) is deemed significant when it is capable of protecting the 
human or animal having the response against the pathogen and/or against a pathogen 
associated with the antigen. 

An antigen is deemed exposed on a bacterial host (also referred to as 
(surface) exposition of the antigen) when it is present, forms part of, is attached to, 
and/or is otherwise associated with or detectable on (e.g. using a suitable 
immunological detection technique such as FACS or immunofluorescent 
microscopy) the surface of the bacterial host (e.g. the bacterial cell wall or envelope) 

Preferably, exposed means that the bacterium, when presented to a cell of a 
human or animal that is capable of mediating an immune response (such as the cells 
of the G.I. tract mentioned below) for a sufficient time and in a sufficient amount, is 
capable of eliciting a sufficient immune response against the antigen. 

Vaccines 

A preferred embodiment of the invention is a vaccine wherein the bacteria 
comprises an expression vector, for example capable of expressing the modified 
protein (e.g. an antigen), suitably on the cell surface (or intracellularly). The 
modified protein may thus be expressed internally or externally. Preferably the 
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protein is exposed on the cell surface (e.g. under conditions present in the 
gastrointestinal tract). 

The bacteria may express a modified protein (e.g. the heterologous 
polypeptide) specific for mucosa colonising pathogens or pathogens entering the 
body via the mucosa, specifically via the oral route. The heterologous polypeptide 
(e.g. antigen) may thus be specific for a gastrointestinal tract colonising pathogen. A 
heterologous polypeptide specific for tetanus {Clostridium tetanus), such as TTFC, is 
a particularly suitable candidate. 

Preferably the vaccines are formulated such that a single dose is sufficient. 
However embodiments where multiple applications over a period of time, e.g. with a 
view to the persistence of the bacterial host in the G.I. tract, are also envisaged. The 
provision of booster vaccinations is also envisaged with the vaccine formulations 
according to the invention. 

A preferred administration regimen comprises one or more "initial" doses or 
administrations on any of days 1 to 4, followed by one or more booster 
administrations on any of days 14 to 21, and optionally one or more further booster 
administrations on any of days 28 to 25. A single initial administration, followed by 
a single booster administration, within this time period, will generally be sufficient. 

As high a degree of expression as possible without damaging the viability of 
the cell or the host to be vaccinated is envisaged. With higher expression, less 
frequent and lower doses may be required for immunisation purposes. Naturally the 
dosage regime will not only depend on amount of antigen but also on antigen type 
and the presence or absence of other immunogenicity stimulating factors in the 
vaccine. 

One can use homologous expression and/or secretion signals in the expression 
vector(s) present in the recombinant bacteria in the vaccine: this may give a high 
degree of expression. The expression regulating signals used in the constructs in the 
Examples, as well as other expression signals, are suitable. 
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Heterologous polypeptide 

Such a polypeptide can be' a binding or target protein, a linker, or an antigen 
or antibody or part thereof. It is preferably an antigen, and so may be able to elicit or 
stimulate an immune response. It can be any antigen against which an immune 
response, more specifically a significant immune response and/or a protective 
immune response, can be elicited in an animal (preferably a mammal such as a 
human). The polypeptide (or antigen) may be associated with a pathogen, disease 
state and/or disorder of the human or animal to which the vaccine is to be 
administered. Preferably the antigen will be able to interact with one or more (e.g. 
specific) receptors, for example present on lymphocytes or in antibodies released 
from them. 

The polypeptide can thus comprise an immunogen. Since the polypeptide can 
be one that can elicit an immune response, this may exclude for example a protein 
that is present already or originates from in the individual. Antigens are thus 
preferably foreign to that individual. 

Any antigen, antigenic component or epitope known per se that can be 
expressed in the host can be used as the polypeptide. Usually this will be a peptide, a 
protein, or an antigenic part or fragment thereof, such as an epitope. As such, it may 
either be a native antigenic peptide or protein (or part, fragment or epitope thereof) or 
an antigenic analog or mutant thereof, for instance obtained synthetically or using 
recombinant DNA techniques. 

Recombinant bacteria and/or bacterial strains, as well as vaccines based 
thereon, may be provided that can be used to illicit a significant immune response, 
and preferably a protective immune response, against various antigens. Suitable 
(heterologous) polypeptides thus include: 

- antigens or antigenic determinants from a virus, bacterium, fungus, yeast or 
parasite; 

- allergens; 

- viral and/or bacterial antigens including those from (e.g. the gpl60 envelope 
protein of) the HTV virus, such as HTV-1 or HIV-2, a surface glycoprotein (of a 
Leishmania parasite), Shiga-like toxin, Shigella lipopolysaccharide antigen, 
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Escherichia coli heat labile toxin B subunit or K88 or fimbrial antigen, a CFA 
antigen (of an enterotoxigenic Escherichia coli strain), anthrax toxin, pertussis 
toxin or toxin from Bordetella pertussis (e.g. P69), tetanus toxin (or a fragment 
thereof, e.g. TTFC); 
- antigens from such pathogens as herpes virus, rubella virus, influenza virus, 
mumps virus, measles virus, polio(myelitis) virus (type 1, 2 or 3), rotavirus, 
FMDV, respiratory syncytial virus, Campylobacter species. Chlamydial 
organisms, species of the genus Cryptosporidium, cytomegalovirus, human 
immunodeficiency virus, Actinomyces species, Entamoeba histolytica, 
arenaviruses, arboviruses, Clostridium botulinum, species of the genus Candida, 
Vibrio cholera, Oyptococcus neoformans y EHEC strains of E. coli 0157:H7, 
026:H11, OHl:H8 and O104:H21, ETEC strains ofE. coli, strains of E.coli 
shown to possess enteroinvasiveness (EIEC), EPEC strains of E.coli, EAggEC 
strains of E.coli., DAEC strains of E.coli, filoviridae, parvovirus, Filarioidea, 
Staphylococcus aureus, species of the genus Clostridium peifringens, C. tetani, 
V. cholera, N. meningitides, C. trachomatis, Helicobacter pylori, Caliciviruses, 
Giacardia lamblia, Neisseria gonorrhoeae, hantaviruses, hepatitis viruses types 
A, B, C, D, E, Legionellae strains, Mycobacterium leprae, Listeria 
monocytogenes, species of the genus Clostridium pei-fi'ingens, Borrelia 
burgdorferi, Pseudomonas pseudomallei, Epstein Barr virus. Onchocerca 
volvulus, Poxviruses, Bordetella pertussis, Yersinia pestis, Coxiella burnetti, 
rabies virus, Treponema pallidium, Mycobacterium tuberculosis, Salmonella 
typhi, eukaryotic parasite causing malaria, Pneumocystis pneumonia, as well as 
agents causing toxoplasmosis. 

Preferably the polypeptide does not comprise a pilus type epitope (e.g. a pilin 
or adhesintope, such as from P. aeruginosa). The polypeptide may be a combination 
of two or more of the above linked (e.g. covalently) or as a fusion protein. 

Preferably the allergen is a human allergen, or an allergen provoking an 
allergic reaction in the type or species of individual to whom the composition is to be 
administered. It may be a house or insect allergen, such as from dust mite, e.g. 
Derp 1. 
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Apart from native antigens and antigenic components (including antigenic 
parts, fragments or epitopes thereof), antigenic mutants or analogues thereof - 
obtained synthetically or via recombinant DNA techniques - may be used. Also, in 
the vaccines of the invention, a combination of two or more such antigens may be 
present or expressed. These antigens may be expressed by a single (type or strain) of 
bacterial host, or by several different (types or strains) of bacterial host. 

Of the above, antigens against and/or specific for rotavirus, respiratory 
syncytial virus, Mycobacterium tuberculosis, human immunodeficiency virus, E.coli, 
Vibrio cholera, streptococci and chlamydia are especially preferred for use as an 
antigen in the vaccines of the invention. 

The polypeptide is preferably such that, upon expression, it still allows - at 
least to some extent, which may be reduced compared to the native strain - the 
recombinant bacterial host to settle in and/or colonize (part of) the gastrointestinal 
tract upon administration, and to persist there. This may be for a time sufficient to 
provide a significant immune response against the antigen and/or the pathogen 
associated with it. 

Preferably the polypeptide is at least 5, 10, 15, 20, 30 or 40 amino acids in 
length. The polypeptide may be no more than 50, 70, 100 or 200 amino acids long. 

Fusion proteins 

The heterologous polypeptide need not be a single protein, indeed it may be a 
combination of two or more polypeptides. However, the polypeptides(and hence 
also the modified protein) is suitably heterologous to the cell in which the (S-layer) 
protein is expressed. The heterologous polypeptide may therefore be an antigen or 
toxin linked to a viral protein, for example a tetanus protein (or antigen) covalently 
linked or fused to a viral protein. Preferably the tetanus protein is TTFC. The viral 
protein is preferably from a rotavirus, such as of the group Reowridae. It may be 
from a rotavirus of group A, B or C. The viral protein may be a capsid protein, for 
example an inner capsid protein (such a VP6) or an outer capsid protein (such as a 
VP4 or VP7). The protein may be a viral capsid protein and/or a viral surface 
protein. It may be a glycoprotein. 



. WO 03/055906 PCT/EP02/14749 

27 

Preferably the viral protein is at least 20, 25, 30, 40 to 50 amino acids in 
length. Preferably it is not a transmembrane protein, or a protein from HIV (such a 
GP41). 

The modified protein may additionally possess an anchor, or an anchor or 
binding regional domain, in order to allow it to be attached or tethered to a cell 
surface. This may be necessary if the bacterial cell does not normally express 
S-layer proteins, for example if it does not possess an S-layer. The anchor may be 
attached to the C~terminus. For example, if one is to obtain expression hi L. casei or 
L. plantamm, which do not possess an S-layer, then one can employ a cell wall 
anchor or binding region or domain (such as described earlier for location at the 
surface of the host), and if necessary a sorting signal, for example from! casei, such 
as PrtP (Martinez et al, supra). 

Lactobacillus or LAB Expression systems 

Usually, the expression system will comprise a genetic construct comprising 
at least one nucleotide sequence encoding the (modified) polypeptide, or antigenic 
component), preferably operably linked to a promoter capable of directing expression 
of the sequence in the bacterial host. Suitably the polypeptide to be expressed can be 
encoded by a nucleic acid sequence that is adapted to the preferred codon usage of 
the bacterial host . The construct may further contain (all) other suitable element(s), 
including enhancers, transcription initiation sequences, signal sequences, reporter 
genes, transcription termination sequences, etc., operable in the selected bacterial 
host. 

The construct is preferably in a form suitable for transformation of the 
bacterial host and/or in a form that can be stably maintained in the bacterial host, 
such as a vector or plasmid. More preferably, a food grade construct is used. 

A particularly preferred construct according to the invention comprises the 
multi-copy expression vector described in WO-A-96/324S7 or WO-A-01/21200, in 
which the nucleotide sequence encoding the antigen has been incorporated. Such a 
construct is particularly suitable for expression of a desired protein or polypeptide in 
a lactic acid bacterium, in particular in a Lactobacillus, at a high level of expression, 
and also can be used advantageously to direct the expressed product to the surface of 



WO 03/055906 



PCT/EP02/14749 



28 

the bacterial cell. The constructs may be characterised in that the nucleic acid 
sequence encoding the antigen is preceded by a 5 ! non-translated nucleic acid 
sequence comprising at least the minimal sequence required for ribosome recognition 
and RNA stabilisation. This can be followed by a translation initiation codon which 
may be (immediately) followed by a fragment of at least 5 codons of the 5' terminal 
part of the translated nucleic acid sequence of a gene of a lactic acid bacterium or a 
structural or functional equivalent of the fragment. The fragment may also be 
controlled by the promoter. The contents of these two WO publications, including 
the differing embodiments disclosed therein, and all other documents mentioned in 
this specification, are incorporated herein by reference. 

The vaccines of the invention are preferably oral vaccines, that is to say they 
are adapted for oral administration. Such oral vaccine compositions will usually be 
alkaline, since usually alkali is required in order to neutralise acid in the stomach, 
and allow the bacteria (or at least most of them) to pass through the stomach into the 
intestine alive. It is preferred that most of the bacteria administered will survive the 
stomach, and pass into the intestine. Increased immune responses can be achievable 
when the bacteria are alive, that is to say viable, rather than dead. This is because 
they can continue to express the modified protein in vivo. 

If the heterologous polypeptide is inserted at a location where it is not 
exposed, or at least partly present in the S-layer ? then this may reduce or avoid 
unwanted degradation or proteolytic processing of the polypeptide (in the GI tract). 
Not all pharmaceutical formulations will be alkaline, and therefore those that are not 
alkaline (for example nasal formulations) may not be suitable for oral administration. 

The sequence encoding the modified protein and/or heterologous polypeptide 
(e.s. antigen) can be obtained from any natural source and/or can be prepared 
synthetically using well known DNA synthesis techniques. The sequence encoding 
the antigen can then (for instance) be incorporated in a suitable expression vector to 
provide a genetic construct of the invention, which is then used to transform the 
intended bacterial host strain. 

The recombinant bacterial host thus obtained can then be cultured, upon 
which the harvested cells can be used to formulate the vaccine, optionally after 
further purification and/or processing steps, such as freeze-drying to form a powder. 
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The techniques required to create the genetic constructs containing the 
antigen-encoding sequence and for transforming, culturing and harvesting the 
bacterial hosts are well known in the art. For instance they are described in standard 
handbooks, including Sambrook et aL and F. Ausubel et al 9 eds., "Current protocols 
in molecular biology", Green Publishing and Wiley Interscience, New York (1987), 

The vaccine comprising the bacterial host can be formulated in a known 
manner, such as for the formulation of vaccines and/or for the formulation of 
preparations of live bacteria for oral administration to an animal or human. Reference 
can be made to preparations for the administration of probiotics, e.g. for the 
treatment of gastrointestinal disorders. 

The vaccine according to the invention can be in a form suitable for oral 
administration, which may be a solid, serui-solid or liquid form, including but not 
limited to solutions and/or suspensions of the bacteria, which are usually preferred. 

The vaccine preparation may also be in the form of a powder, such as a freeze 
dried powder that can be reconstituted before use, e.g. using a suitable liquid. It may 
be in the form of a solid or liquid preparation that is (to be) mixed with solid, semi- 
solid or liquid food prior to administration. It may also be in the form of a fennented 
product. 

Besides the bacteria, the vaccine may contain one or more pharmaceutically 
acceptable carriers or excipients, such as water. The vaccine may also contain one or 
more adjuvants, including immune adjuvants, suitable for oral administration. These 
are compatible with the bacterial host and suitably do not interfere (too much) with 
its desired immunogenic properties. According to one embodiment, the adjuvant may 
be a lactic acid bacterium, such as the bacterial host strain itself, one of the other L. 
plantarum strains mentioned above, another Lactobacillus species, or even a 
Laciococcus, Bifidobacterium or Propionibacierium species suitable for oral 
administration to humans or animals. 

Also, the vaccine may contain one or more further therapeutic substances 
and/or one or more substances that can facilitate and/or enhance the colonization of 
(part of) the G.I. tract by the bacteria, and/or the growth of the bacteria in the G.I. 
tract. The preparation may also be in a form suitable for (direct) administration into 
the stomach or gut, for instance via a tube or catheter. Preferably upon oral 
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administration the bacterial host settles in, and may thereupon colonise, the 
gastrointestinal tract, or at least a part thereof, such as the mouth, the gut, the small 
intestine (e.g. duodenum, jejunum or ileum), the large intestine (or part thereof, such 
as the caecum) or colon, and preferably either the small intestine or the caecum. 

Immunogenicity 

The modified protein, and so, if desired, the heterologous polypeptide (e.g. 
antigen) expressed by the bacterial host thus can come into contact with the mucosal 
layer, the lining and/or the wall of the G.I. tract (collectively referred to hereinbelow 
as "wall of the G.I. tract"), and more specifically with cells within this wall. This can 
mediate an immune response against the antigen(s) thus presented, such as antigen- 
presenting cells (for example macrophages, dendritic cells and/or B-lymphocytes). 
This immunological response by the cells within the wall of the G.I. tract may by 
itself already constitute a significant immune response as defined herein, and/or it 
may trigger further immunological reactions/responses in the body of the human or 
animal to which the vaccine has been administered, which again may be a significant 
response and/or may be a protective response as defined herein. However, the 
invention is not limited to any specific mechanism via which the recombinant 
bacterial host elicits any immune response(s). For instance, the immune response 
elicited by an antigen as expressed by the bacterial host may provide a stronger or 
enhanced immune response compared to the response that would be elicited by the 
antigen as such, e.g. as a (free) soluble protein. 

The bacterial host - or any part or fragment thereof and/or any further 
compound(s) produced by it - may interact with the wall of the G.I. tract and/or with 
specific cells therein such as the cells which mediate and/or are involved in the 
immune response. This may enable, facilitate or enhance the immune response to the 
antigens associated with the bacterial host, such as compared to administration, 
expression and/or use of the antigen as a free, soluble protein. 

Although the (modified) protein is preferably expressed by the bacterial host 
such that the antigen becomes exposed on the cell surface, it is not excluded that - in 
order to illicit the immune response - the contents of the cells of the bacterial hosts 
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are in situ (i.e. locally at the wall of the G.L tract) released and/or liberated from the 
bacterial cell, e.g. by a mechanism which makes the walls of the bacterial cell wall 
permeable and/or destroys the cells or the bacterial cell wall. Thus it is possible that 
(in situ at the wall of the G.L tract) the immunogenic response may not be (just) 
caused/elicited by the intact bacterial host, but by a part, fragment, fraction or 
compound thereof for example the antigen as such and/or cell fragments or cell 
fractions comprising the antigen. Thus although it is preferred that the vaccines 
comprises intact, viable and/or live bacteria, vaccines that (e.g. also) contain 
fragments, fractions, lysates etc., of or derived from the recombinant bacterial host 
are not excluded. 

Dosages 

The amount of bacteria administered is not critical, but suitably it is sufficient 
for the bacteria to settle into and/or colonize (the desired part of) the G.L tract, and/or 
to cause a significant immune response. A suitable amount will be at least 10 s cfu, 
preferably 10 s - 10 10 cfu per dose. This may allow a sufficient amount of bacteria to 
pass into the intestine, if required. Oral administration of doses less than 10 s cfu may 
not always give the desired immunogenicity (at least not in a reliable manner), 
whereas amounts of more than 5xl0 10 cfu 5 if cumbersome to administer orally, are 
less preferred. The above amounts of bacteria (dosages) may correspond, to for 
instance, 10 6 to 10 8 cfu per kg of body weight of the human or animal. The 
concentration of bacteria in the vaccine (or other formulation) may be at least 
5xl0 9 /ml, such as at least 10 10 /ml. The formulation may be administered for only up 
to 2, 3 or 4 days. The bacteria may still be detectable in the individual at least 5 
days, 7 days or 9 days after the first or last administration. 

Vaccine formulations 

Preferably the individual to be vaccinated is a human or an animal. The 
human can be an infant, immunocompromised person, elderly person or a normally 
healthy infant, child or adult. 
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One advantage of the bacterial hosts used is that they can be capable of 
surviving in/passing through the gut in amounts sufficient to colonize the intestine(s). 
Nevertheless, one can adniinister the bacteria in or as a coated or encapsulated 
preparation, for instance in the form of a delayed release composition or an enteric 
coated composition. Suitable encapsulating compounds include but are not limited to 
chitosan, maltodextrin, lipids and oligo- and polysaccharides. Encapsulation may 
also improve the shelf-life of the vaccine. The vaccine may be adjuvant-free but 
preferably contains one or more adjuvants. 

Strains of other Lactobacillus species may also prove suitable for use as a 
bacterial host, for example in vaccines of the invention. These can include strains 
from L. pentosus, L, reuteri, L. animalis (= L. murinus), L. feimentum, L. 
acidophilus y L. anspatus, L. gasseri^ L. johnsonii, L. salivarius, L. bi'evis, L. 
rhamnosis, L. helveticus and/or L. paracasei. 

The strains useful in the invention preferably have GRAS status and more 
preferably are food-grade. Also, they are most preferably used in combination with 
the expression vectors of WO-96/32487 mentioned above, or another vector that 
gives a level of expression (e.g. intracellularly and/or exposed on the surface of the 
bacterial host) comparable to this preferred expression vector. 

Furthermore, although the use of strains belonging to the genus Lactobacillus 
are preferred, it is envisaged that suitable strains could possibly also be selected from 
bifidobacteria and the propriobacteria, e.g. from the genus Bifidobacterium and/or 
the genus Propriobacterium. Suitable strains can be selected by the skilled person in 
the same way as L.plantarum and/or Lactobacillus can be selected. 

For instance, a suitable test for detemiining/confiiming whether a selected 
strain is suitable as a bacterial host according to the invention, is to transform the 
host with a plasmid carrying the replication region of pAMBl, pNZ71, pLP323 under 
the control of promoter sequences such as the regulatable promoters Paniy and Pxyl 
or the constitutive promoters Pldh and PslpA. Examples of such vectors are the 
TTFC carrying vector pPG421 (for surface anchored/surface exposed expression of 
the TTFC-antigen under control of P amy) and/or the TTFC carrying vector pPG623 
(for intracellular expression of the TTFC-antigen under control of Pxyl), then to 
administer the recombinant host thus obtained orally to an animal, preferably a 
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mammal (e.g. a mouse such as a BALB/c and/or C57bl/6 mouse), preferably 
according to the single dose priming and boosting schedule as mentioned in the 
Examples, followed by measuring the end-point titres of IgG in individual sera by 
ELISA using tetanus toxoid. In such an assay, the selected recombinant host 
preferably provides higher (i.e. at least 1% higher) titres than L.plantarum NICMB 
8826 and/or L.plantarum 80 when transformed with the same vector and 
administered under the same conditions; and more preferably titres which are at least 
10% higher, even more preferably at least 20% higher. 

Where the law allows, the invention also relates to the administration of the 
bacterium or vaccine to an individual, such as a human or animal (e.g. a mammal), 
where that individual (or subject where appropriate) is in need of bacterium or 
vaccine. The individual may require treatment or prophylaxis or a particular disease* 
as described above. 

Further Variants and Homologu.es 



The (modified) proteins of the invention can if necessary be produced by 
synthetic means although usually they will be made recombinantly. They may be 
modified for example by the addition of histidine residues or a T7 tag to assist their 
identification or purification or by the addition of a signal sequence to promote their 
secretion from a cell. 

Modifications can be made to the original or wild-type sequence, and yet the 
protein may still be an S-layer protein. This may include the deletion, modification' 
or addition of single amino acids or groups of amino acids within the protein 
sequence, preferably so that the peptide maintains the basic biological functionality 
of the protein. 

Conservative substitutions may be made, for example according to the following 
Table. Amino acids in the same block in the second column and preferably in the 
same line in the third column may be substituted for each other. Preferably 
substitutions do not affect the folding or activity of the polypeptide. 
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ALIPHATIC 


Non-polar 


GAP 






ILV 




Polar-uncharged 


CSTM 






NQ 




Polar-charged 


DE 






KR 


AROMATIC 




HFWY 



Alterations 

Proteins of the invention may be chemically altered, e.g. post-translationally. 
For example, they may be glycosylated (one or more times, by the same or different 
sugars) or comprise modified amino acid residues. They may also be modified by the 
addition of histidine residues (to assist their purification) or by the addition of a 
signal sequence (to promote insertion into the cell membrane). The protein may 
have one or more (N) amino- or (C) carboxyl-teiminal extensions, such as an amino- 
terminal methionine residue, a small linker peptide of up to about 20-25 residues, or 
a (small) extension that facilitates purification, such as a poly-histidine or T7 tag, an 
antigenic epitope or a (e.g. maltose) binding domain (e.g. at the C -terminus). These 
extensions may or may not be added via a linker. 

Pohnmcleotides. 

The present invention provides an (e.g. isolated and/or purified) polynucleotide 
encoding a (modified) protein of the invention. The present invention thus provides 
a polynucleotide encoding the modified protein. Also included is a polynucleotide 
selected from: 

(a) a polynucleotide encoding the modified protein or the complement thereof; 

(b) a polynucleotide comprising a nucleotide sequence capable of (e.g. 
selectively) hybridising to a nucleotide sequence set out in (a), or a fragment 
thereof; 
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(c) a polynucleotide comprising a nucleotide sequence capable of (e. g. 
selectively) hybridising to the complement of the nucleotide sequence set out in 
(a), or a fragment thereof; and/or 

(d) a polynucleotide comprising a polynucleotide sequence that is degenerate 
as a result of the genetic code to a polynucleotide defined in (a), (b) or (c). 
Polynucleotides in (b) and (c) can contain the change(s) corresponding to the 

modification(s) in the protein. 

Hybridisable sequences 

The term "capable of hybridizing" means that the target polynucleotide of the 
invention can hybridize to the nucleic acid used as a probe (for example the 
nucleotide sequence of the modified protein, or a fragment thereof or the 
complement thereof) at a level significantly above background. The nucleotide 
sequence may be RNA or DNA and thus includes genomic DNA, synthetic DNA or 
cDNA. Preferably the nucleotide sequence is a DNA sequence and most preferably, a 
cDNA sequence. Polynucleotides of the invention can be synthesized according to 
methods well known in the ait. 

A polynucleotide of the invention can hybridize to the coding sequence or the 
complement of the coding sequence of SEQ ID No.l at a level significantly above 
background. Background hybridization may occur, for example, because of other 
cDNAs present in a cDNA library. The signal level (eg. generated by the interaction 
between a polynucleotide of the invention and the coding sequence or complement of 
the coding sequence of SEQ ID No. 1) is typically at least 10 fold, preferably at least 
100 fold, as intense as interactions between other polynucleotides and the coding 
sequence of SEQ ID No. 1 . The intensity of interaction may be measured, for 
example, by radiolabelling the probe, e.g.^with 32 P. Selective hybridization may 
typically be achieved using conditions of low stringency (0.3M sodium chloride and 
0.03M sodium citrate at about 40°C), medium stringency (for example, 0.3M sodium 
chloride and 0.03M sodium citrate at about 50°C) or high stringency (for example, 
0.3M sodium chloride and 0.03M sodium citrate at about 60°C). Hybridization may 
be carried out under any suitable conditions known in the art 1 and, as a guide, low 
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stringency can be 2 x SSC at 55°C, medium stringency can be 0.5 to 1.0 x SSC at 
60°C and high stringency can be 0.1 or 0.2 x SSC at 60°C or higher (e.g. at 6S°C), all 
at 0.5% SDS. 

Alterations 

Polynucleotides of the invention may comprise DNA or RNA. They may be 
single or double stranded. They may also be polynucleotides which include within 
them one or more synthetic or modified nucleotides. A number of different types of 
modifications to polynucleotides are known in the art. These include 
methylphosphonate and phosphorothioate backbones and/or addition of acridine or 
polylysine chains at the 3' and/or 5' ends of the molecule. For the purposes of the 
present invention, it is to be understood that the polynucleotides described herein 
may be modified by any method available in the art. 

It is to be understood that skilled persons may, using routine techniques, make 
nucleotide substitutions that do not affect the protein sequence encoded by the 
polynucleotides of the invention to reflect the codon usage of any particular host 
organism, for example in which the polypeptides : of the invention are to be 
expressed. 

The wild-type protein sequence may be modified by nucleotide substitutions, for 
example from or up to 1, 2 or 3 to 10, 25, 50 or 100 substitutions. The 
polynucleotide may alternatively or additionally be modified by one or more 
insertions and/or deletions and/or by an extension at either or both ends. The 
modified polynucleotide generally encodes a protein which is antigenic or is an 
S-layer protein. Degenerate substitutions may be made and/or substitutions may be 
made which would result in a conservative amino acid substitution when the 
modified sequence is translated, for example as discussed with reference to proteins. 

Recombinant Aspects 

The invention also provides vectors comprising a polynucleotide of the 
invention, including cloning and expression vectors, and methods of growing, 
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transforming or transfecting such vectors in a suitable host cell, for example under 
conditions in which expression of a polypeptide of the invention occurs. Provided 
also are host cells comprising a polynucleotide or vector of the invention wherein the 
polynucleotide is heterologous to the genome of the host cell. The term 
"heterologous", usually with respect to the host cell, means that the polynucleotide 
does not naturally occur in the genome of the host cell or that the polypeptide is not 
naturally produced by that cell. 

Polynucleotides of the invention can be incorporated into a recombinant 
replicable vector, for example a cloning or expression vector. The vector may be 
used to replicate the nucleic acid in a compatible host cell. Thus in a further 
embodiment, the invention provides a method of making polynucleotides of the 
invention by introducing a polynucleotide of the invention into a replicable vector, 
introducing the vector into a compatible host cell, and growing the host cell under 
conditions which bring about replication of the vector. The vector may be recovered 
from the host cell. Suitable host cells are described below in connection with 
expression vectors. 

Vectors 

The polynucleotide of the invention may inserted into an expression cassette. 
The vector into which the expression cassette or polynucleotide of the invention is 
inserted may be any vector which may conveniently be subjected to recombinant 
DNA procedures, and the choice of the vector will often depend on the host cell into 
which it is to be introduced. Thus, the vector may be an autonomously replicating 
vector, i.e. a vector which exists as an extra-chromosomal entity, the replication of 
which is independent of chromosomal replication, e.g. aplasmid. Alternatively, the 
vector may be one which, when introduced into a host cell, is integrated into the host 
cell genome and replicated together with the chromosome(s) into which it has been 
integrated. 

Preferably, a polynucleotide of the invention in a vector is operably linked to a 
regulatory sequence which is capable of providing for the expression of the coding 
sequence by the host cell, i.e. the vector is an expression vector. The term "operably 
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linked" refers to a juxtaposition wherein the components described are in a 
relationship permitting them to function in their intended manner. A regulatory 
sequence such as a promoter, enhancer or other expression regulation signal 
"operably linked" to a coding sequence is positioned in such a way that expression of 
the coding sequence is achieved under condition compatible with the control 
sequences. 

The vector may be a plasmid, cosmid, virus or phage vector, usually provided 
with an origin of replication, optionally a promoter for the expression of the 
polynucleotide and optionally an enhancer and/or a regulator of the promoter. A 
terminator sequence may be present, as may be a polyadenylation sequence. The 
vector may contain one or more selectable marker genes, for example an ampicillin 
resistance gene (in the case of a bacterial plasmid) or a neomycin resistance gene (for 
a mammalian vector). Vectors may be used in vitro, for example for the production 
of RNA or used to transfect or transform a host cell. They may comprise two or 
more polynucleotides of the invention, for example for overexpression. 

The DNA sequence encoding the protein is preferably introduced into a suitable 
host as part of an expression cassette (or construct) in which the DNA sequence is 
operably linked to expression signals which are capable of directing expression of 
the DNA sequence in the host cells. For transformation of the suitable host with the 
expression construct transformation procedures are available which are well known 
to the skilled person. The expression construct can be used for transformation of the 
host as part of a vector carrying a selectable marker, or the expression construct may 
be co-transformed as a separate molecule together with the vector carrying a 
selectable marker. The vector may comprise one or more selectable marker genes. 

A vector or expression construct for a given host cell may comprise the 
following elements operably linked to each other in a consecutive order from the 5'- 
end to 3'-end relative to the coding strand of the sequence encoding the polypeptide 
of the first aspect: 

(1) a promoter sequence capable of directing transcription of the DNA 
sequence encoding the polypeptide in the given host ceil; 

(2) optionally, a signal sequence capable of directing secretion of the 
polypeptide from the given host cell into a culture medium; 
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(3) a DNA sequence encoding a mature and preferably active foim of the 
polypeptide; 

(4) preferably, a transcription termination region (terminator) capable of 
terminating transcription downstream of the DNA sequence encoding the 
polypeptide; and 

(5) optionally, a repressor (e.g. XylR). 

Downstream of the DNA sequence encoding the polypeptide there may be a 3 ! 
untranslated region containing one or more transcription termination sites (e.g. a 
terminator). The origin of the terminator is less critical. The terminator can e.g. be 
native to the DNA sequence encoding the polypeptide. The terminator may be 
endogenous to the host cell (in which the DNA sequence encoding the polypeptide is 
to be expressed). 

Enhanced expression of the polynucleotide encoding the protein of the invention 
may also be achieved by the selection of heterologous regulatory regions, e.g. 
promoter, secretion leader and/or terminator regions, which may serve to increase 
expression and, if desired, secretion levels of the protein of interest from the 
expression host and/or to provide for the inducible control of the expression of the 
polypeptide of the invention. Preferably at least one, and advantageously all, of 
these regulatory regions or elements originate from, or are compatible with, 
Lactobacillus bacteria. This may render the vectors and/or host cells food grade (or 
GRAS), except for any antigen encoding sequences. 

Aside from the promoter native to the gene encoding the protein of the 
invention, other promoters may be used to direct expression of the polypeptide of the 
invention. The promoter may be selected for its efficiency in directing the 
expression of the polypeptide of the invention in the desired expression host. 

Promoters/enhancers and other expression regulation signals may be selected to 
be compatible with the host cell for which the expression vector is designed. For 
example prokaryotic promoters may be used, in particular those suitable for use in 
Lactobacillus or E.coli strains. 

Examples of bacterial promoters are the xylose, lactate dehydrogenase, 
a- amylase and SPo2 promoters as well as promoters from extracellular protease 
genes. 
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Host cells and Expression 

In a further aspect the invention provides a process for preparing a polypeptide 
according to the invention which comprises cultivating a host cell (e.g. transformed 
or transfected with an expression vector as described above) under conditions to 
provide for expression (by the vector) of a coding sequence encoding the protein, and 
optionally recovering the expressed protein. Polynucleotides of the invention can be 
incorporated into a recombinant replicable vector, e.g. an expression vector. The 
vector may be used to replicate the nucleic acid in a compatible host cell. Thus in a 
further embodiment, the invention provides a method of making a polynucleotide of 
the invention by introducing a polynucleotide of the invention into a replicable 
vector, introducing the vector into a compatible host cell, and growing the host cell 
under conditions which bring about the replication of the vector. The vector may be 
recovered from the host cell. Suitable host cells include bacteria such as 
Lactobacillus or lactic acid bacteria, E. coli, yeast (e.g. Kluyveromyces), mammalian 
cell lines and other eukaryotic cell lines, for example insect cells such as Sf9 cells 
and (e.g. filamentous) fungal cells. 

Preferably the polypeptide is produced as a secreted protein in which case the 
DNA sequence encoding a mature form of the polypeptide in the expression 
construct is operably linked to a DNA sequence encoding a signal sequence. 
Preferably the signal sequence is native (homologous) to the DNA sequence 
encoding the protein. Alternatively the signal sequence is foreign (heterologous) to 
the DNA sequence encoding the protein, in which case the signal sequence is 
preferably endogenous to the host cell in which the DNA sequence is expressed. 

The vectors may be transconjugated, transformed or transfected into a suitable 
host cell as described above to provide for expression. This process may comprise 
culturing a host cell transformed with an expression vector as described above under 
conditions to provide for expression by the vector of a coding sequence encoding the 
protein. 

A further aspect of the invention thus provides host cells transconjugated, 
transformed or transfected with or comprising a polynucleotide or vector of the 
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invention. Preferably the polynucleotide is carried in a vector for the replication and 
expression of the polynucleotide. The cells wall be chosen to be compatible with the 
said vector and may for example be prokaryotic (for example bacterial), fungal, yeast 
or plant cells. 

The invention encompasses processes for the production of the protein of the 
invention by means of recombinant expression of a DNA sequence encoding the 
protein . For this purpose the DNA sequence of the invention can be used for gene 
amplification and/or exchange of expression signals, such as promoters, secretion 
signal sequences, in order to allow economic production of the protein in a suitable 
homologous or heterologous host cell. A homologous host cell is herein defined as a 
host cell which is of the . same species or which is a variant within the same species as 
the species from which the DNA sequence is derived. 

Preferred features and characteristics of one aspect of the invention are 
equally applicable to another aspect mutatis mutandis. 

The invention will now be illustrated by means of the following Examples, 
which refer to the figures, which are provided merely for the purposes of illustration, 
and the invention is not to be construed as being limited to the subsequent Examples. 

EXAMPLES 

Summary? 

The structure of the crystallisation domain, SAN, of the S A -protein ofZ,. 
acidophilus ATCC 4356 was analysed by insertion and deletion mutagenesis, and by 
proteolytic treatment. Mutant S A -protein synthesised hxE. coli with 7-13 amino acid 
insertions near the N-terminus or within regions of sequence variation in SAN 
(amino acid position 7. 45. 114, 125, 193) could form crystalline sheets, whereas 
insertions in conserved regions or in regions with predicted secondary structure 
elements (positions 30, 66, 88 and 156) destroyed this capacity. An insertion in the 
cell wall binding domain (position 345) did not affect crystallisation. FACscan 
analysis of L. acidophilus synthesising three crystallising and one non-crystallising 
S A -protein c-myc (19 amino acids) insertion mutant was performed with c-myc 
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antibodies. Fluorescence was most pronounced for insertions at positions 125 and 
156, less for position 45 and severely reduced for position 7. Immunofluorescence 
microscopy revealed a fluorescent ring in a fraction of the bacterial population, 
suggesting that these bacteria synthesised mutant S A -protein only. The finding that 
the chromosomal sip A gene was replaced by the mutant allele in a subset of the 
population corroborates this conclusion. Proteolytic treatment of SA-protein indicated 
that only sites near the middle of SAN are susceptible, although potential cleavage 
sites are present through the entire molecule. Expression in E. coli of DNA 
sequences encoding the two halves of SAN yielded peptides that could dimerise. The 
results indicate that SAN consists of a —1 2 kDa N- and a -1 8 kDa C-terminal 
subdomain linked by a surface exposed loop. The capacity of SA-protein of L. 
acidophilus to present epitopes, up to — 19 amino acids in length, at the bacterial 
surface and the exchange of the wild type by a mutant allele in vivo, makes the 
system suitable for application as an oral delivery vehicle. 

Introduction 

Surface layers or S-layers have been found in up to 400 different species of 
Eu- and Archaebacteria. They consist of one species of (glyco-)protein, the S- 
protein, which assembles into characteristic two-dimensional crystalline layers at the 
cell surface. This assembly is an entropy-driven process during which individual S- 
protein monomers form multiple interactions with each other and with the under- 
lying cell envelope (Beveridge, 1994; Sleytr & Messner, 1983). Why certain bacteria 
possess an S-layer is not always known but they have been shown to function as 
molecular sieve, scaffold for extracellular enzymes, protective coat or virulence 
factor (Egelseer et aL, 1995; Noonan & Trust, 1997; Sara & Sleytr, 1987). 

S-protein structure-function relationships remain poorly understood due to 
the lack of suitable methods to determine S-protein structure at the atomic level. 
Certainly the best understood aspect of S-protein structure is the association of S- 
proteins with the cell wall. Several cell wall associating domains have been described 
for S-proteins from Bacillus, Corynebacterium, Lactobacillus and Tliemxobacterium. 
Recent studies have shown that many S-proteins contain one or more copies of the 
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so-called Surface Layer Homology or SLH domain for cell surface attachment 
(Egelseer et al, 1998; Lemaire et al, 1998; Mesnage et al, 1999; Olabarria et al., 
1996). In contrast, much less is known about the role of structural components or 
domains in S-protein crystal formation but a few studies have shown that distinct 
crystallisation domains exist in some S-proteins, possibly corresponding to the 
morphological domains observed in EM studies (Baumeister et al. 9 19S9; Engelhardt 
& Peters, 1998; Jarosch et al, 2001; Mesnage et al, 2000; Smit et al., 2001). 

Several species of the genus Lactobacillus possess an S-layer. The S-layers of 
L. acidophilus and related species are composed of a single S-protein (S A -protein ) of 
around 45 kDa (Boot et al, 1996). The S A -protein of L. acidophilus ATCC 4356 and 
CbsA of L. crispatus JCM 5810 are the best studied with respect to structure and 
function (Boot et al., 1993; Sillanpaa et al., 2000; Smit et al., 2001). The function of 
the S-layer of these organisms is unknown but lactobacillar S-layers may be 
important for bacterial adhesion to intestinal epithelial cells and extracellular matrix 
(ECM) components (Schneitz et al., 1993; Toba et al., 1995). 

S A -protein of L. acidophilus shows considerable similarity to the putative 
product of slpB encoded by a silent S-protein gene of L. acidophilus, to S-proteins 
foml. helveticus andZ. crispati4S, and to a haem-agglutinating protein (HAP50) 
from L. acidophilus, but not to other S-proteins (Boot et al., 1996; Boot et al., 1995). 
Analysis of the primary amino acid sequences of these proteins showed that 
homology between these proteins is highest in the C-tenninal one third of the 
proteins (77-99% identity) and lower in the N-terminal two third (30-72% identity). 
While the N-terminal region shows a preponderance of hydrophobic amino acids, the 
C-tenninal region is mainly composed of hydrophilic residues, a large fraction of 
which consists of basic amino acids, rendering this region highly positively charged. 

It has been demonstrated that the N-terminal and C-terminal parts of SA- 
protein constitute different structural and functional domains. The N-terminal part of 
S A -protein (amino acids 1-290) constitutes the crystallisation domain, SAN. and is 
able to form S-layer crystals with lattice parameters similar to those of crystals 
formed by intact S A -protein. The C-terminal part of S A -protein (amino acids 291- 
412), SAC, serves to attach the S-layer to the cell wall (Smit et al., 2001). 
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S A -protein is among the smallest S-proteins known, making S A -protein an 
ideal candidate for the study of structure-function relationship of S-proteins. 
Insertion mutagenesis is a method widely used to determine structure-function 
relationships of proteins (Bingle et aL, 1997; Norton et aL, 1998; van Geest & 
Lolkema, 2000; Wong & Hancock, 2000). In the present invention insertion 
mutagenesis and proteolytic treatment was employed to gain further insight into the 
structural organisation of the L. acidophilus S-layer protein and its crystallisation 
domain SAN in particular. The results suggest a structural model for S A -protein in 
which SAN is comprised of two subdomains of around 15 kDa each flanking a loop 
region that is exposed to the environment. DNA sequences encoding the two 
hypothetical subdomains were expressed in E. coli and the properties of the resulting 
polypeptides were determined. 

This shows that functional SA-protein mutants can be used for the efficient 
surface exposure of a selected epitope making the system suitable for application as 
an oral vaccination vehicle. 

Results 

Insertion of peptide linkers in the SA-protein of L. acidophilus 

The linker insertion mutagenesis strategy yielded ten different slpA insertion 
mutants slpAll, slpAll, slpA\3, slpAU, slpAlS, slpA\6, slpAlS, slpA\9, slpA20 and 
slpAll, possessing linker insertions (NcNoX; 7 or 8 amino acids) after amino acid 
45, 125, 156, 177, 30, 66, 88, 1 14, 193 and 349, respectively, of the mature SA- 
protein. One additional mutant, slpA9c, contained a c-myc epitope insertion at amino 
acid 7 instead of the NcNoX linker insertion. The mutant genes and unmodified slpA 
(slpAlO), were transferred to expression vector pQE30AXN and the corresponding 
proteins Sha9c, Shah, Shai2, Shai3» Sham, Shais, Shai6 ; Shais 5 Shai9, Sha20, Sha21 and 
Shaio were synthesised in E. coli and purified by metal-affinity chromatography 
(Table 1). 



Table 1. 
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Mutant 


oite/type oi 
modification 


Position 
(AA no) 1 


AA composition' 


oize 
(AA) 


Crystal 
formation 


10 


N-terminal hi stag 3 


0 


-RGSHfiGSGT - ATTIN 


13 


+-H- 


11 


ifr/EII/linker-peptide 


45 


AIAGN - AM AAARGN - LTGTI 


8 


4-H- 


12 


Age\f\ mker-p ep ti d e 


125 


VKFTG - AMAAARAG - TNSDN 


S 


+++ 


13 


Mun I/linker-pep tide 


156 


TNVSI - TMAAARGI - ANVYA 


8 




14 


Spe\A inker-pep tide 4 


177 


YD VTS - AMAAARAS — GATVT 


8 




15 


C/fll/linker-peptide 


349 


NAANI - VAMAAARG- DGTICR 


8 


+++ 


16 


Psrl/hnker-peptide 


30 


PS VS A - SMAAARGA- VAANT 


8 




18 


,4/ivNI/Imker-peptide 


67 


LKADT - MAAARNT - EN ATI 


7 




19 


Pstl /] inker-pep tide 


SS 


A T~T A A CXjTA A AT» A \ / A X rT" 

AELAA - SMAAARGA — GVAYT 


8 




20 


Sryl/linker-peptide 


114 


KTVTL - A M A A A R G L- GSANS 


8 


+++ 


21 


Srvl/linker-peptide 


193 


NADNQ -AMAAARGQ- VNVAN 


8 


+ 


9c 


c-myc 


7 


TTINA - EQKLISEEDLNLE - SSSAI 


13 




11c 


c-myc 


45 


-AMVNEQKLISEEDLNARGN- 


19 


-H-r 


12c 


c-myc 


125 


•AMVNEQKL1SEEDLNARAG* 


19 




13c 


c-myc 


156 


-TMVNEQKLISEEDLNARGI- 


19 





1 : Amino acid (AA) positions relative to the start of the mature S A -protein 
2: Flanking sequences are underlined: these are SEQ ID NOS. 3 to 17 
3: This His-tag is present at the N-terminus of all mutants listed 

4: Insertion of 23 bp instead of 24 bp resulted in a frameshifi mutation and truncation of the protein 



All mutant genes were expressed at high levels in E. coli. SDS-PAGE 
analysis revealed that the gene products had the expected molecular mass (Mr) of 
about 46 kDa, slightly larger than wild type S A -protein (results not shown). For 
mutant Sham a truncated peptide of about 25 kDa was observed, the result of a 
frame-shift mutation already observed during sequencing. Mutant Sha9c showed a Mr 
of around 47 kDa due to a larger insert, the c-myc epitope. 

Functional analysis of SA-protein mutants synthesised in E. coli 

Centrifugation of dialysed mutant S A -proteins allowed discrimination 
between crystallised and soluble S A -protein. Mutant proteins Sha9c Shah, Shah, 
Shais, Shai6 and Sha20 and the wild type protein Sraio formed a precipitate, 
indicative of S A -protein crystallisation (Smit et aL, 2001), mutant Shaii formed less 
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precipitate than the previous mutants, while mutants Shab, Srais, Shai9 and the 
truncated Sham did not show any precipitate formation. SDS-PAGE analysis of the 
soluble and insoluble fractions confirmed that a substantial part of S-protein was 
present in the pellet fractions of Sha9c, Shaio, Shaiu Shai2> Srais, Shai6> and Shaio- 
For all other mutants, S A -protein was found only in the supernatant fraction (Figure 
2). Electron microscopical analysis of the precipitates formed by wild type (His- 
tagged) S A -protein (Shaio) and the precipitating mutants showed crystalline sheets 
possessing oblique symmetry, similar to that of S A -protein (data not shown). 

Three insertion mutants, slpAl 1, slpAll, and slpAU, were selected for 
replacement of the NcNoX linker by a c-myc epitope linker. This yielded mutant 
proteins Shaiic Shai2c and Shaisc with an insert of 19 amino acids and an Mi* of 
around 47 kDa. Introduction of the larger insert did not alter the properties of the 
mutant proteins. Shaiic and Shai2c could still form S-layer crystals whereas mutant 
Shai3c was unable to form crystals as determined by EM (data not shown). 

Consti'uction of expression vectors for Lactobacillus and synthesis ofS A -protein with 
a c-myc insertion in L. casei and L. acidophilic 

To determine whether the c-myc epitope in mutants Sha9c Shaiic, Shai2c and 
Shahc is buried in the S A -protein interior or is exposed on the S-layer surface, mutant 
S A -proteins were co-expressed with wild type S A -protein inZ. acidophilus. For the 
expression of mutant S-protein genes in Lactobacillus new vectors were constructed 
in which the sIpA9c, slpAUc, slpAllz and slpAUc genes were combined with the 
original sip A expression signals (the 5' leader sequence for mRNA stabilisation, 
ribosome binding site (RBS), start codon and secretion signal) (Figure 1). The 
complete sip cassettes were then transferred to the E. coli-Lactobacillus shuttle 
vector pLP401-T and introduced into L. casei ATCC 393. 

Analysis of mutant sip gene expression in L. casei revealed that all 
transformants produced mutant S A -protein irrespective of whether the idh terminator 
sequence between the promoter and mutant sip gene was present or not (results not 
shown). Mutant S A -protein production under inducing conditions from vectors 
without the terminator was only marginally increased compared to that from vectors 
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with the transcription terminator. This constitutive production of mutant S A -protein 
in L. casei under non-inducing conditions from vectors with the Idh transcription 
terminator, suggests the presence of a previously undetected promoter sequence in 
the 5' region of the sip fragments. In further experiments we used this promoter 
activity for constitutive mutant S A -protein production in L. casei and L. acidophilus. 
After confirmation of the synthesis and secretion by L. casei of the mutant S A - 
proteins, of their correct relative molecular mass and the presence of the c-myc 
epitope by SDS-PAGE and western analysis (data not shown), CsCl purified vector 
DNA (without Idh terminator) obtained from L. casei was introduced in L. 
acidophilus ATCC 4356. 

L. acidophilus transformants produced less mutant S A -protein compared to L. 
casei. From Z. acidophilus transformants producing S A9c , S A iic, S A i 2c and S A13c 
mutant protein as well as the wild type strain S A -protein could be completely 
removed by 5 M LiCl extraction, indicating that the bacteria contain a mosaic S-layer 
comprised of wild type and mutant protein. Mutant proteins were not detected in the 
culture medium. Using SDS-PAGE and western analysis, . an additional band, with a 
slightly lower mobility than the wild type S A -protein band, was observed in the 5 M 
LiCl extract of the four L. acidophilus transformants. Western analysis also showed 
that this band reacted with auti-c-niyc antibody while wild type S A -protein did not, 
confirming the presence of the c-myc epitope (results not shown). It was also 
observed that mutants S A9c and S Anc were produced at equal levels, while the S A i 2c 
production level was lower and that of S A i 3c even more so (Figure 3). 

SDS-PAGE analysis of soluble and precipitated S A -protein fractions obtained 
by dialysis and centrifugation of the 5 M LiCl extracts revealed that all four S A - 
protein mutants associated with both pellet and supernatant fractions, a behaviour 
typically observed for fully functional wild type S A -protein. No preferential 
association was observed of the non-cry stallising mutant S A i 3c with the dialysate 
supernatant fraction although this was observed for the protein purified from E. coli 
(data not shown). 



Exposition of th e c-myc epitope at the cell surface ofL. acidophilus 

Surface presentation of the c-myc epitope was analysed by flow cytometry 
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and immunofluorescence microscopy. FACScan analysis with c-myc antibodies 
yielded a signal for each of the L. acidophilus transfoimants although important 
differences were observed. The detected fluorescent signal increased in the following 
order: S A 9c, Saiic, S A i3c S A i2c (Figure 4). Interestingly, the cytometry signals were not 
proportional to the mutant protein production levels, which were lowest for the 
mutants with the highest fluorescent signals (S A i2c and S A i 3c ). 

The flow cytometry experiments also showed, especially for mutants S A i2 C 
and S A i3c, a large heterogeneity in fluorescence intensity. Using immunofluorescence 
microscopy two types of fluorescent cells were observed, one with diffuse 
fluorescent spots randomly distributed on the cells and another with an intense 
fluorescent ring completely surrounding the cell (data not shown). 

Detection of recombination at the chromosomal slpA locus 

The S-layer of cells possessing a surrounding fluorescent ring possibly 
consists solely of mutant S A -protein since a similar intense ring was observed using 
wild type cells and anti-S A -protein serum (Boot, 1996). To determine whether the 
chromosomal copy of the sip A gene in these cells has been inactivated or replaced by 
the mutant one carried by the expression vector a PCR strategy was applied. Using 
genomic DNA preparations isolated from L. acidophilus transfoimants harbouring 
pLPs/pA9c, pLPs/pAl lc, pLPslpAllc, and pUPslpA13c we were able to amplify 
recombination-specific fragments for all four transformants, indicating that the 
chromosomal slpA gene had been replaced by the mutant allele. In a control 
experiment using wild type L. acidophilus genomic DNA mixed with purified 
pLPslpAllc as template we did not find any PCR product (data not shown). 

Proteolytic treatment of SA-protein 

Limited digestion of S A -protein with various proteolytic enzymes yielded one 
major product of 36 kDa representing SAN, and several smaller fragments. The 
N-terminal sequence of these fragments has been determined: the fragments obtained 
after trypsin digestion had the following N-termini ATTESF and VKLDQ (amino 
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acids 1-5 and 140-144), and those obtained after chymotrypsin digestion ATTIN and 
AINTT (1-5 and 160-164), showing that these peptides represented the N-terminal 
and C-terminal regions of SAN. Apparently only sites near the middle of SAN are 
susceptible to proteolytic cleavage although a number of other potential trypsin and 
chymotrypsin cleavage sites are present in the two peptides. 

Properties of truncated SAN fragments 

To get further insight in the structure of SAN, vectors were constructed 
encoding C-terminally (SAN2, 4 and 6 carrying the first 159, 149 and 1 13 amino 
acids) or N-tenninally truncated peptides (SAN7, 3 and 5 comprising the last 177, 
151 and 141 amino acids). The truncated His-tagged genes were expressed in E. coli 
and the peptides SAN2, SAN4, SAN6 and SAN7 were purified by affinity 
chromatography. The peptides SAN3 and SANS were produced at very low levels 
and could not be purified probably due to increased sensitivity to E. coli proteases. 
SDS-PAGE and western analysis indicated that the truncated proteins had the 
expected size. Western blotting also showed the presence of molecules with twice the 
size of the monomers for the peptides, SAN2, SAN4, SAN6 and SAN7 (Figure 5). 
The stability of the peptides against proteolytic attack and their capacity to dimerise 
suggests that they constitute functionally active units. 

r 

Discussion 

Traditionally, S-layers have been studied by electron microscopical 
techniques and as a result much is known about their ultrastructure. Little is known, 
however, about S-protein structure-function relationships. In recent years, with the 
advent of versatile recombinant DNA technologies, valuable new clues to the 
structural organisation of S-proteins have been obtained. Methods like deletion 
analysis, sub-cloning of domains, linker mutagenesis and "cysteine-scanning" 
mutagenesis have been used for this purpose (Jarosch et al, 2001 ; Mesnage et a/., 
1999; Smit et a/., 2001). 
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, In the invention linker insertion and deletion mutagenesis were applied as 
well as S-protein proteolysis to clarify the structural organisation of the 
crystallisation domain, SAN, of the L. acidophilus S A -protein. The analysis 
comprised insertion of seven to thirteen amino-acid residues at eleven positions 
randomly distributed throughout the S A -protein ? ten of which were located in SAN. 
All mutant proteins contained an additional N-terminal His-tag (13 amino acids), 
which did not interfere with S A -protein crystallisation as was previously confirmed 
by electron microscopy. 

Introduction of insertions at positions 30, 66, 88, and 156 of S A -protein 
(Shai3, Srai6 5 Shais and Shaip) resulted in complete abolishment of the capacity to 
form crystals in vitro. Mutants with insertions after amino acids 7, 45, 125, 1 14 
(Sha9c> Shah, Shai2, and Sha2o) formed crystals similar to those formed by the His- 
tagged wild type protein, while an insertion at position 193 (Sha2i) resulted in a 
partially functional mutant S A -protein. An insertion in the cell wall binding domain 
(Shais) had no effect on assembly, as expected. 

For the interpretation of these results multiple S-protein sequence alignments 
and predicted S-protein sequence motifs (Sillanpaa et al. 9 2000; Smit et aL, 2001) 
were used. The data obtained from these analyses showed that the SAN domain of 
S A -protein consists of regions with high variability in composition and length (gap 
regions) alternating with regions that show considerably less variation. It was 
proposed that the regions of higher conservation are important for domain structure- 
function (responsible either for intra- or intermolecular interactions) (Sillanpaa et aL 9 
2000; Smit et aL, 2001). Disruption of these regions may result in loss of domain 
function, while regions of variable length are more likely to represent protein surface 
regions or loops (Miyazawa & Jemigan, 2000) and may accept the insertion of 
additional amino acid residues. 

In general, insertions in regions that were well aligned in S A -protein 
alignments or contained predicted secondary structure elements were not allowed. 
This in contrast to insertions in variable, non-conserved regions without any 
predicted secondary structure elements, which were accepted without disrupting 
S-layer formation. Two mutated regions are of importance. First the N-terminal 
region of the S A -protein. The N-terminus itself is flexible and does not directly play a 
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role in S A -protein crystallisation since it accepts an extension of 13 amin o acids 
without loss of domain function (Smit et a/., 2001). Insertion of additional amino 
acids at position 7 (ShasO was probably accepted because of the proximity to the SA- 
protein N-terminus. If the insertion was made further away from the N-terminus 
(amino acid 30; Shaio) then protein function was compromised, possibly due to 
improper folding of the region or to interruption of protein-protein interactions 
between S A -protein monomers. Despite the lack of significant conservation between 
amino acids 30 and 47, this region seems to be important for S A -protein 
crystallisation. 

The second region in which mutations 12 and 20 were made (positions 125 
and 114, respectively), is interesting because of its location in the centre of the SAN 
domain where insertions of up to 19 amino acids did not affect S A -protein 
crystallisation. This central region is the largest variable region observed in 
alignments and is the only region in SAN containing sites that are accessible to 
proteolytic enzymes. 

Finally, flow cytometric and immunofluorescence analysis of mutant S A - 
protein-producing L. acidophilus showed that the regions near amino acids 125 and 
156 (S A i2c and S A i3 C ) were strongly reacting with anti-c-myc antibodies, whereas 
those near amino acids 9 and 45 (S A $> C or S A n c ) were not. From these results we 
conclude that the former regions are facing the environment, while the N-terminal 
region is poorly accessible to antibodies and thus is either buried within SAN or is 
facing the S-layer pore or cell wall. 

Interestingly, mutant S A i3c» although non-functional in in vitro analyses, 
forms an integral part of the S-layer in Wvo, since the mutant protein could be 
extracted from L. acidophilus transformants by 5M LiCl. Moreover, a strong positive 
signal was observed in the FACS analysis implying that c-myc in S A n c is located at 
the bacterial surface. Since the S-layer of the transformants is comprised of wild type 
and mutant S A -protein, we conclude that S A is c can functionally interact with wild 
type S A -protein. 

The difference in distribution of the flow cytometry signals, most clearly 
detectable for L. acidophilus transformants producing Sai2c or Saisc, suggested a 
difference in mutant protein production levels within the population of bacteria. 
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Immunofluorescence microscopy and PCR experiments showed that in a fraction of 
the bacterial population the wild type S A -protein was fully replaced by the mutant 
protein as a result of the exchange of the chromosomal sip A gene by the plasmid 
encoded mutant allele. Since this phenomenon was also observed for S A i3c we 
assume that S A i3c can form a genuine S-layer in vivo. An explanation why Sauc 
forms an S-layer in vivo but not in vitro might be that interaction of S A -protein with 
the cell wall stabilises the interaction of the S A uc monomers, or induces a 
conformational change of the monomers facilitating their assembly. Also the absence 
of the His-tag in S A i3 C may positively affect its crystallisation. 

Based on the proteolysis experiments it was assumed that the central region 
divides the SAN domain into two parts that may constitute subdomains. To verify 
this assumption the properties of N- and C-terminally truncated SAN peptides were 
determined. Truncated forms of SAN could be produced in E, coli with a size of 12- 
19 kDa that were insensitive to proteases present in E. coli, when up to 170 C- 
terminal residues or up to 120 N-terminal residues were deleted. Both C-terminally 
and N-terrninally truncated peptides were capable of forming dimers in vitro that 
could resist boiling in the presence of 1% SDS. Apparently these fragments represent 
"functionally active units, as they can dimerise through protein-protein interaction 
under strongly denaturing conditions. Since all three C-terminally truncated peptides 
were proteolytically stable and could dimerise whereas only the largest N-terminally 
truncated form (SAN7) showed this behaviour, one can conclude that the subdomain 
boundaries are located between amino acids 114 and 140. 

The two subdomains described above could represent the two morphological 
subdomains observed in our electron crystallographic study of S A -protein and SAN. 
For each SAN monomer four contact points with neighbouring monomers could be 
observed, three in the elongated and one in the spherical subdomain (Smit et al. 9 
2001). Figure 6 shows a model which schematically depicts how crystals of SAN 
could be formed by dimerisation and oligomerisation of its subdomains. 

Finally, the experiments demonstrate how to surface-expose epitopes on the 
S-layer of Z. acidophilus. Based on this finding the presentation of other antigens on 
the surface of lactobacilli using the S-layer can be used in oral or nasal vaccines. 
Lactobacilli possess several additional properties that make them highly suitable for 
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such applications, i.e. their GRAS (generally regarded as safe) status, their 
immunomodulating properties and their capacity to evoke mucosal and systemic 
immune responses against associated antigens (Maassen et al, 1999- Shaw et al., 
2000). The observation that mutant S-layer genes can replace the chromosomal wild 
type gene by genetic recombination, rendering the mutation genetically stable, 
further adds to the application potential of genetically engineered Lactobacillus S- 
proteins. 



Materials and Methods 



Bacterial strains and growth conditions 

L. acidophilus ATCC 4356 and L. casei ATCC 393 were cultivated 
anaerobically in MRS broth (Difco) at 37°C. Lactococcus lactis MG1614 was 
cultivated aerobically in GM17 medium at 30°C. Escherichia coli M15 (pREP4) 
(Qiagen) and E. coli DH5a (Phabagen, The Netherlands) were cultivated aerobically 
in Luria (L) broth at 37°C. When necessary, media were supplemented with 1,5% 
agarose, 5 ug/ml (L. lactis) or 7.5 ug/ml (L. casei and L. acidophilus) 
chloramphenicol, 100 ug/ml ampicillin and/or 25 ug/ml kanamycin (E. coli). 

Linker insertion mutagenesis of theL. acidophilus slpA gene 

A pET5a (Promega) derived vector, pTs/pAlO (previously named pTAlO), 
containing a BamHI-Hindm PGR fragment encoding mature S A -protein (Smit et al., 
2001) was used for insertion mutagenesis. Double stranded DNA linkers were 
inserted in unique restriction sites or in sites occurring twice in the slpA gene (Figure 
la). Linkers consisted of Ncol, Notl zndXhoI (NcNoX) sites flanked by restriction 
site-specific 5' and 3' sticky ends. Ligation mixtures were introduced in E. coli 
DH5a and clones harbouring pTslpAlO vector with the desired linker insert, were 
identified by colony PCR and analysed by DNA sequencing. The resulting plasmids 
are indicated with pTslpA followed by the mutant number as indicated in Table 1 
(pTslpAl 1, pTslpA12 etc). 
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Functional analysis of Sj-protein insertion mutants in E. coli 

Mutant S A -protein genes were cloned in pQE30AXN as BamHL-HindTII 
fragments introducing an N-terminal six histidine tag (plasmids are indicated with 
pUslpA followed by mutant number). Mutant SA-protein gene expression and metal 
affinity purification were carried out as previously described (Smit et al., 2001). 
Purified mutant S A -protein (500 [xg/inl) was dialysed exhaustively against 50 mM 
Tris-HCl, pH 7.5 and checked for precipitation, which is indicative of S A -protein 
crystal formation (Smit et al., 2001). Soluble and precipitated S-protein fractions 
were separated by centrifugation and analysed by SDS-PAGE. Precipitated material 
was also analysed by electron microscopy (EM) (Smit et aL, 2001). In three mutant 
plasmids pHLs/pAl 1, pHslpAll and pHslpA!3 the NcNoX linker was removed by 
Ncol and XJtoI digestion and replaced by a c-myc epitope linker (Ncol-c-myc-XIioT). 
The mutant proteins were produced and analysed as described for the other mutants. 
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Construction of plasmid pHslpA9c 

An additional insertion mutant was constructed by PCR using vector pBKl 
(Boot et al., 1993) as template and primers 5'SLPA1 (5' 

GCGCGAATTCAGATCTATCGTGGTAAGTAATAGGACGTG 3': SEQ ID NO. 
18) and CMYCRE 

(5 ' C AGCGAATTCCTCGAGGTTTAAATCTTCTTCTGAAATTAACTTTTGTTC 
TGCGTTAATAGTAGTAGCAGCGC 3': SEQ ID NO. 19) yielding 5 9 slpA9c. This 
PCR product was introduced in vector pTslpA\6-3 (containing Sail, BaniKL and 
Xliol sites, after amino acid 7) as BgHL-XJiol fragment yielding pT5's/pA9c. For 
purification and functional analysis of the mutant S A -protein in E. coli the sip region 
of pT5's/pA9c was amplified using primers CExAMYCl 

(5 ' GGGGGGATCCGGTACCGCTACTACTATTAACGCAGAAC 3': SEQ ID 
NO. 20) and CEA2(5' 

CCCCGGATCCAAGCTTATCGAAGTATCAGAAGATCCTATT 3 5 : SEQ ID NO. 

21) and the BamHI-BstETI fragment was transferred to pELs/pAlO yielding pHslpA9c. 
Functional analysis of the purified mutant protein was performed as described above. 

Re-introduction of 5 ' sip A expression signals in mutants slpAllc, sip A 12c and 
slpAlSc 

A fragment including the 5' expression signals of the sip A gene (nt -190 to + 
150 relative to the ATG), flanked by BglH (5') and Hindm (3') sites, was amplified 
from plasmid pBKl (Boot et al, 1993) by PCR using primers 5'SLPAl (5' 
GCGCGAATTCAGATCTATCGTGGTAAGTAATAGGACGTG 3': SEQ ID NO. 

22) and 5'SLPA2 (5 5 GGGGAAGCTTCAGTAGTGCTACCAGCAGCAG 3': SEQ 
ID NO. 23). The PCR product was inserted in pGEM-T and after sequence 
confirmation excised with EcoRL and HindUI and inserted in pUC19 to give pS'slpA. 
Mutant SA-protein cassettes isolated from pTslpAl lc. pTslpAllc, and pTslpAISc as 
BstEU or Pstl-Hindni fragments were cloned in p5 'sip A to yield p5 5 sip A 1 1 c 5 
pS'slpAllo and p5"slpA\3c. Clones containing the complete sip cassettes were 
identified by colony PCR and restriction analysis. 
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Construction of expression vectors for mutant S A -protein genes in L. casei and L. 
acidophilus 



To achieve the synthesis of mutant S A -proteins S A 9c S Allc , S A i 2 c and S A13c in 
L. casei and L. acidophilus, the multi-host range vector pLP401-T was used. This 
vector contains the inducible promoter of the Lactobacillus amylovorus a-amylase 
gene (Pouwels et al, 2001). The complete sip cassettes were isolated from 
p15'slpA9c, p5'slpAl lc, p5'slpA12c and p5\y/p,413c as BgHL-Hindm fragments and 
cloned in BamYWHwdUI digested pLP401-T to yield vectors j>LPslpA9c, 
pLPslpAllc, pUPslpAUc and pLPslpAl3c. Ligation mixtures were introduced in L. 
lactis MG1614 and correct clones were selected by colony PCR. Highly purified 
plasmid DNA isolates were prepared by CsCl gradient centrifugation, digested with 
Notl to remove the Idh terminator, ligated and introduced in L. casei ATCC 393. 
Colony PCR was used to confirm the absence of the terminator sequence. 

Expression analysis of mutant S A -protein genes in L. casei and L. acidophilus 

L. casei and L. acidophilus transformants harbouring pLPslpA9c, 
pLPslpAl lc, pLPslpAllc and pLPslpA13c and untransformed L. casei ATCC 393 
and L. acidophilus ATCC 4356 were inoculated 1 : 100 from ON cultures into 50 ml 
MRS medium and incubated for 16 h at 37°C. Cells were collected by centrifugation 
(25 nun, 3,000 x g, 4°C) and proteins in the culture supernatant were precipitated 
with TCA. The cell pellet was washed once with physiological salt and subjected to 1 
M and 5 M Li CI extraction to remove surface associated proteins as described 
previously (Smit et al., 2001). The 1 M and 5 M LiCl extracts were dialysed against 
AD at 4°C. 

Exposition ofc-myc at the S-layer swface ofL. acidophilus transformants 

Exposition of the c-myc epitope on the S-layer surface by L. acidophilus was 
determined by flow cytometry and immunofluorescence microscopy. Transformants 
containing P LPslpA9c, pLPslpAl lc, pLPslpAUc and pL?slpA13c and wildtype L. 
acidophilus cells were harvested at the end of the log phase and washed once with 
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phosphate buffered saline (PBS) supplemented with 1 % (w/v) fetal calf serum 
(FCS). After centrifugation the cell pellet was resuspended in PBS-FCS 
supplemented with 1 % FCS containing anti-S A -protein or anti-c-myc (1 :5000 and 
1:50 diluted, respectively) antibodies. After incubation with FITC-labelled goat-anti- 
mouse antibodies, cells were analysed with a FACSCalibur flow cytometer (Becton 
Dickinson, San Jose, CA) as described (Pouwels et ai, 2001), or inspected with an 
immunofluorescence microscope. 

PCR detection of chromosomal sip recombinants 

Genomic DNA was isolated from transformants containing expression 
vectors pLPs/p^9c, pLPslpAl lc, pLPslpAllc and ipLPslpA\3c. PCR was performed 
according to standard procedures using primer A9 (forward, sip promoter-specific: 
5 ' -CTTGCTATTTCTTGAAGAG-3 ' : SEQ ID NO. 24) and primer CMYCINT 
(reverse, c-myc -insert-specific: 5 '-GCGTTTAAATCTTCTTCTGAA-3 ' : SEQ ID 
NO. 25) and 15 ng of chromosomal DNA as template. To avoid misinterpretation of 
the results due to a Polymerase Halt-mediated Linkage Of Primers or PHLOP event, 
PCR was also performed using purified chromosomal DNA obtained from wild type 
L. acidophilus which was mixed with purified pLP 'sip Allc DNA at physiological 
relevant molar ratios (chrDNA : plDNA ratio's of 1:0.3, 1:3, 1:30 and 1:300). 

Construction, purification and functional analysis of truncated SAN fragments 

Three DNA fragments encoding C-terminally truncated SAN peptides were 
amplified by PCR with pBKl (Boot et al., 1993) as template. Primer CEA1 (5 5 - 
GGGGGGATCCGGTACCGCTACTACTATTAACGCAAGTTC -3', forward: 
SEQ ID NO. 26) was used in combination with primer SANA1 (5'- 
CCCCGAATTC AAGCTTTTATTAGT- ATACGTTTGC AATTG AAAC ATT AG-3 ' , 
reverse: SEQ ID NO. 27) to amplify SAN2, SANA2 (5'- 

CCCCAAGCTTTTATT ATGAAGC AACACCGTTTTGGTC-3 5 , reverse: SEQ ID 
NO. 28) for SAN4 and SAN A3 (5 5 - 

CCCCAAGCTTTTATT AACCAAGGGTAAC AGTCTTACC-3 ' . reverse: SEQ ID 
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NO. 29) for SAN6. Fragments encoding three N-terminally truncated peptides were 
amplified using primer SANB1 (5 '-GGGGGGATCCGGTACCAAAGTTAAGT- 
TAGACCAAAACGGTG-3 ' : SEQ ID NO. 30) for SAN3, SANB2 (5'- 
GGGGGGATCCGGTACCC-TT-ACTAATGTTTC AATTGCAAACG-3 5 : SEQ ID 
NO. 31) for SANS and SANB3 (5'-GGGGGG- 

ATCCGGTACCTC AGCTAACTCAAATGTAAAATT-3 ' : SEQ ID NO. 32) for 
SAN7, all three in combination with primer SAN1 

(5 5 CCCCGAATTC AAGCTTTTATTAAATTC-TCTTGCTTAGCTGGGCT AC-3 \ 
reverse: SEQ ID NO. 33) and the same template. The fragments were cloned in 
pGEM-T and sequenced. The fragments encoding SAN2, SAN3, SAN4, SANS, 
S AN6 and SAN7 were excised as BamiH-HindDl fragments and introduced in vector 
pQE30AXN (Smit et aL, 2001) in frame with an N-tenninal six histidine tag. 
Expression of gene fragments and purification of the gene products was performed as 
described previously (Smit et aL, 2001). Functional analysis of the truncated SAN 
peptides consisted of dialysis of samples (500 jig/ml) against 50 mM Tris-HCl, pH 
7.5 followed by SDS-PAGE and western analysis. 

DNA manipulation, protein analysis 

DNA manipulations and protein analysis were performed following standard 
procedures (Laemmli, 1970; Sambrook et aL, 1989). Plasrnid DNA was isolated 
from Lactococcus/ Lactobacillus according to published procedures (Kok et al. 9 1984; 
Posno et aL, 1991). Genomic DNA was isolated from lactobacilli using 
cetyllximethylammonium bromide (CTAB) (Towner, 1991). Transformation ofi. 
lactis, L. casei and L. acidophilus was performed following published procedures 
(Kok et aL, 1984; Posno et al, 1991; Walker et aL 1996). Proteolytic treatment of 
SA-protein was performed as previously described (Smit et aL, 2001). 

Vaccine preparation and immunisation 

The capacity of recombinant L. acidophilus S-layer proteins to evoke an 
immune response against the inserted c-myc epitope was assessed for mutant 
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S-protein S A i 2c that exposes c-myc to the environment and for mutant S-protein Sa9 C 
in which c-myc is buried within the S-layer. Female C57B 1/6 and Balb/c mice were 
immunized three times a week with a 3-week interval between immunizations. 
Purified mutant S-protein (range 25-100g per dose) in phosphate buffered saline 
(pH7) or in Freund's incomplete adjuvant (Difco Laboratores) was adminitered 
intraperitoneally. Serum samples were collected 14 and 28 days past prime 
immunisation, and 7 and 14 days past booster immunisation. The induction of 
antibodies was determined in an ELISA using purified Sha 9 c Shai->c, and Shaio 
protein as probes, as well as a purified cmyc carrying recombinant protein (CD81- 
cmyc) as a positive control. The results indicate the recombinant L. acidophilus S- 
layer proteins with inserted antigens can evoke an immune response to both the 
protein itself (Table 2, positive response of all Sha proteins against SA10) as well as 
to the inserted antigen (Table 2, positive response against CD81-cmyc). 



Table 2 
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The proteins (50ul) were coated o/n onto plastic wells at 4°C at a 
concentration of 0-10ug/ml and after pretreatment with skimmed milk incubated with 
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anti-cmyc antiserum (Phaimingen). The recombinant protein CDSl-cmyc served as 
a positive control 

In a second experiment the capacity of L, acidophilus expressing Sai2c or 
Sha9c to induce antibodies against the c-myc epitope can be determined. lOOg of 
dead recombinant L. acidophilus, in phosphate buffered saline (pH 7) or in Freund's 
incomplete adjuvant (Difco Laboraties) is to be administered intraperitoneally in a 
2501 volume twice with a 4-week interval between immunisations. Serum samples 
are then collected 14 and 28 days past prime immunisation, and 7 and 14 days past 
booster immunisation. The induction of antibodies is determined in an ELISA using 
purified Sha9c protein as probe, as well as with purified c-myc. This can show that 
recombinant Lactobacilli carrying S-proteins with inserted antigens can be used as 
vaccine carriers. 
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